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It is a Pivotal Time for 


especially that it does not diminish the 
commitment across all educational lev- 


els to address the serious issues in K-12 | 
| the U.S. The annual Computer Science 


computer science education. 


Creating real, sustainable change in | 


any area of education is a frustratingly 
slow process. Public education is a com- 
plex bureaucracy where competing ide- 
ologies, philosophies, ontologies, and 
pedagogies vie for attention and con- 
trol. It is subject to extreme pressures. 
As a profession, teaching is simultane- 
ously professionalized and devalued. 
And, in most cases, the employees are 
overworked, underpaid, and _ severely 
under-resourced. 

But change is possible. Five years 
ago, ACM founded the Computer Sci- 


ence Teachers Association (CSTA) with | 


the goal of addressing serious concerns 
in K-12 computer science education, 
including the lack of curriculum stan- 
dards, poor professional development 
for teachers, common misunderstand- 
ings about computer science, student 
and parent perceptions that there are 
no jobs in the computing field, and the 


complete mess that is computer science | 


teacher certification. 

Today, CSTA stands as an example 
that faith, funding, and a whole lot of 
volunteer support from the top to the 
bottom can achieve something close 
to miracles. The guidelines in the ACM 


Model Curriculum for K-12 Computer Sci- 


ence are now recognized as the defacto 
national computer science curriculum 
standards. Thanks to its partnership 
with colleges and universities through 


K-12 Computer Science 


} 


we 


the JETT and TECS programs, CSTA has 
held more than 96 professional devel- 
opment workshops for teachers across 


& Information Technology symposium 
is the closest thing we have to an annual 
national conference for K-12 computer 
science and information technology 
educators. CSTA’s white paper The New 
Educational Imperative: Improving High 
School Computer Science Education pro- 
vides a cogent, research-supported ar- 
gument for the role that computer sci- 
ence must play in the K-12 academic 
canon. Careers in computing resources 
developed by CSTA and the ACM Edu- 
cation Board have now made their way 
into every school in the U.S. and sev- 
eral other countries as well. Finally, 
CSTA’s new report on teacher certifica- 


tion (Ensuring Exemplary Teaching in 
an Essential Discipline) proposes a new | 


framework for guaranteeing we have 
the teachers we require with the skills 
we need in our classrooms, 


But something equally important | 


came out of the enrollment crisis. Col- 
leges and universities facing dwindling 
class sizes began reaching out to K-12 
computer science educators and stu- 
dents in order to recruit students direct- 
ly into their programs, and in doing so, 
many faculty learned that the assump- 
tions they had been making about what 
it is like to teach computer science in 
K-12 were misguided at best and pater- 
nalistic at worst. 

Many colleges and universities now 


have ongoing outreach and mentoring | 
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programs because they understand that 
waiting for students to come to them is 
a recipe for disaster. Interest in comput- 
er science must begin long before stu- 
dents sign up for their first university or 
college courses. It begins in K-12 where 
other academic disciplines first sow the 
seeds of interest and engagement. 

This increased interest in direct 
outreach to K-12 has also prompted 
a greater level of understanding and a 
spirit of cooperation across educational 
levels that will benefit us greatly in the 
long run, but only if we do not lose sight 
of what we are doing and return to our 
former isolated complacency. 

The next year will be a pivotal one for 
us. Talk (and hopes) of a new, relevant, 
rigorous, and more engaging sequence 
of high school computer science courses 
(including a new gold standard Advance 
Placement computing course) and an 
ambitious plan for teacher professional 
development could mean a real renais- 
sance for our discipline and field. 

As we begin to work toward these 
goals, the challenges will be enormous 
as will the temptation to circle the wag- 
ons and fire inward. I hope our com- 
munity has the vision to rise above frag- 
mentation and discord so we can work 
together to do something important 
and valuable. We did it when we formed 
CSTA and I think we can do it again to 
reframe computer science education. 


Chris Stephenson (chris.stephenson@comcast.net) 
is the executive director of Computer Science Teachers 
Association 
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In CS Education, Educate the Educators First 


N THEIR POINT/COUNTERPOINT 
“CS Education in the U.S.: 
Heading in the Wrong Direc- 
tion?” (July 2009), Robert Dew- 
ar wrote that the CS curriculum 


lacks important fundamentals, mostly | 


of a mathematical nature, and Owen 
Astrachan wrote that “Studying more 
mathematics will not make software 
bugs disappear, although both [Edsger 
W.] Dijkstra and Dewar seem to think 
so,” concluding implicitly that more 
mathematics in the CS curriculum is 
not necessary. 

Astrachan apparently overlooked 
the distinction between necessary 
and sufficient conditions, a routine 
distinction in mathematics. As far as I 
know, neither Dijkstra nor Dewar ever 
claimed that mathematics is a “silver 


bullet” guaranteeing reliable design, | 


and both would likely agree with Fred 
Brooks that indeed there is no silver 
bullet. 

Dijkstra’s and Dewar’s argument 
that mathematics is an essential as- 


pect of a proper CS education mirrors | 


the fact that education in all engineer- 
ing disciplines involves a solid math- 
ematical foundation from the start, 
then used in all (other) courses; see my 
article “Teaching and Practicing Com- 
puter Science at the University Level” 
in Inroads, SIGCSE Bulletin 41, 2 (June 
2009), 24-30. 

As long as CS educators cannot 
agree about the fundamentals, prac- 
tice will remain below the professional 
standards common in classical engi- 
neering; see Allen Tucker et al.’s “Our 
Curriculum Has Become Math-Phobic” 
in SIGCSE Bulletin 33, 1 (Mar. 2001), 
243-247. This could be the result of 
declining CS student enrollment, pos- 
sibly leading to the replacement of 
mathematics with, say, trendy topics 
apparently more appealing to fresh- 


men. However, even trendy topics can | 


be combined with a solid mathemati- 
cal foundation, with the trendy topics 
included as illustrations. In any case, 
the emphasis in teaching such topics 
must be mathematical modeling, not 
mere description. 
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Unfortunately, teachers are divided, 
so combining theoretical CS expertise 
and practical experience in the same 
person is rare, unlike in other engi- 
neering disciplines. Educating the ed- 
ucators may well be a first priority. 

Raymond Boute, Ghent, Belgium 


Modular Programming 
Still a Challenge 
In Leah Hoffmann’s interview “Liskov 
on Liskov” (July 2009) Barbara Liskov 
recalled research being conducted at 
the time she was beginning her career 
(1970). “When I started,” she said, “the 
main way people thought about modu- 
larization was in terms of subroutines... 
But they didn’t have any way of linking 
a bunch of procedures together.” She 
also said (in Karen A. Frenkel’s news 
item “Liskov’s Creative Joy,” also July 
2009) “...the major challenge still is 
how to build large software systems.” 
The definition of “module” inevita- 
bly affects the building of large software 
systems, prompting our own recollec- 


tions and connections. For example, a | 


counterexample to the use of subrou- 
tines as the basic module existed in the 
form of a project that began in 1965 
at Argonne National Laboratory (ANL) 
and was reported at the Spring Joint 


| Computer Conference (May 1969) to 
develop the Argonne Reactor Compu- | 
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tation System (ARC) from linked com- 
putational modules running on IBM 
System 360 computers and OS/360. 
The modules were FORTRAN IV main 
programs where the output of one pro- 
gram was analyzed to become the input 
for other programs. Reactor designers 
wanted three main features: 
Superprograms. To link programs 
into superprograms directed by anoth- 
er FORTRAN IV main program while 
the original programs could still run on 


their own. They wanted all Assembler | 


code and any direct communication 
with the manufacturer’s software to be 
isolated for porting to other hardware. 
Initially, the most important Assem- 
bler code was for LINKing and LOAD- 
ing modules for coordinating the I/O 
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stream and communicating with JCL; 

New algorithms. To improve the 
modules through new-algorithm re- 
writes wherever possible; and 

New modules. To make it possible to 
build modules as required. 

ARC programmers each had five to 
10 years of experience (rare in 1965) 
and advanced degrees in mathematics, 
science, and engineering. They coded 
in Absolute and Assembler on ANL- 
built hardware (AVIDAC and GEORGE) 
and in FORTRAN on the IBM 704 and 
the CDC 3600. Because OS/360 was 
so unstable (as shipped in 1965), they 
needed to be able to decipher dumps. 
Completion of the project would have 
been delayed without programmers ca- 
pable of working close to the hardware 
and operating system. 

The ARC system later became the 
platform for ANL reactor calculations 
and was studied and used by other lab- 
oratories worldwide, though not before 
it was shown to be portable. 

On the first Earth Day (Apr. 22, 1970), 
a joint project with the Control Data 
Corporation aimed to port the ARC sys- 
tem to CDC hardware. We were aided 
in this effort by Richard Lee of CDC 
and Larry Atkins, a Northwestern Uni- 
versity engineering student also known 
for writing Chess 3.6, the winner of sev- 
eral ACM North America Computer 
Chess Championships in the 1960s 
and 1970s. Once the modular environ- 
ment was ported, the computational 
modules were easily ported. In fact, 
after one computational module was 
ported, the remaining work was almost 
automatic. 

Louis C. Just, Lakewood, CO 

Gary K. Leaf, Argonne, IL 

Bert J. Toppel, Argonne, IL 


A Quart of NFE Solution 
for a Pint CPU Problem 
In his article “Network Front-End Pro- 
cessors, Yet Again” (June 2009), Mike 
O'Dell seemed to be arguing what I say 
to colleagues in simpler terms, namely, 
that using a network front-end (NFE) 
processor is (and always has been) a 


bad idea for one basic reason: It must 
perform moderately complex compu- 
tation at least as fast as a computer’s 
“main” CPU, yet manufacturers insist 
it be cheap due to the cost “multiplier” 
effect when a computer includes many 
network links. To show that solving 
this problem is impossible, consider 
the following proof by contradiction: 
Assume an NFE processor with req- 
uisite attributes—moderately general- 
purpose, fast, and cheap—with most 
computer manufacturers using it as 
a main processor, not as a lowly NFE 


and so in need of even more network 
| Oracle database provides high-avail- 


bandwidth. 

Many computer engineers have 
long understood, as O’Dell wrote, that 
the most efficient way to implement a 
network-protocol software stack is to 
use one or more CPUs of an N-way SMP, 
but users strongly resist this idea when 
they discover they’ve paid big for what, 
from an application point of view, is 
only an N-1-way computer; note, too, 


popular “low-end” cases in which N = 2 or | 


4. Apparently, Sun Microsystems (“the 
network is the computer”) hasn’t left 
much of an impression on IT manag- 
ers. The result is that NFE startups 
continue to waste engineering talent 
by trying to pour a quart of technology 
into this particular pint jar. 
Scott Marovich, Palo Alto, CA 


How to Address Big Data 

I want to thank Adam Jacobs for cata- 
loging the important issues in the man- 
agement of large volumes of data in his 
article “The Pathologies of Big Data” 
(Aug. 2009). But please know that in- 


novations are also being made by rela- | 


tional database vendors. One recently 
released product is the HP Oracle Data- 
base Machine (http://www.oracle.com/ 
database/database-machine.html) that 
processes large volumes of data using 
massive horizontal parallelism across 
a shared-nothing storage grid. Pipe- 
line (vertical) parallelism is enabled by 
offloading data processing to a storage 
grid, so the amount of data that must 
be shipped back from the storage grid 
to the database grid is reduced as well. 
The storage grid and database grid are 
connected through a high-speed Infini- 
band interconnect. 

A single DM has an I/O bandwidth 
of 14GB/sec in the first version of the 


product (with uncompressed data). If 
the data is compressed, the effective 
bandwidth is much greater, depend- 
ing on compression ratio. (Jacobs’s ex- 
periment would have run much faster 


on the DM.) Multiple DMs can be con- | 


nected to increase data capacity, along 
with corresponding network/comput- 
ing capacity. 

Oracle’s Automatic Storage Man- 
agement (ASM) is an integral part of 


DM, providing automatic load bal- | 
ancing across all nodes in the storage | 


grid. ASM also provides fault-tolerance 
through dual or triple mirroring. The 


ability and disaster-recovery features, 
and ASM enables sequential I/Os 
through its allocation strategies (such 
as large allocations). 

One of the best ways to improve que- 
ry performance (assuming the most 
optimal access method is used) is to 
avoid I/O altogether. The Oracle data- 
base provides rich partitioning strate- 
gies that enable skipping large chunks 
of data that do not qualify as a query 
scan. 

Moving data from _ production 
(OLTP systems) to a specialized data 
store adds to a system’s total cost of 
ownership, as one would otherwise 
be managing two different data stores 
with poor or no integration. DM solves 
the big-data problem without a special 
data store just for data analysis; DM 
provides a single view of the data. 

Though I am a technical member of 
the Oracle Exadata development team, 


| my aim here is not to plug the product 


but report that the big-data problem is 
indeed being tackled, particularly by 
relational database vendors. 

Umesh Panchaksharaiah, Richmond, CA 


My Generation 
Samuel Greengard’s news story “Are 
We Losing Our Ability to Think Criti- 


cally” (July 2009) is inspiring as a basis | 


for future work. I am coordinating an 
interdisciplinary seminar on the collec- 
tive construction of knowledge (http:// 
seminario.edusol.info in Spanish), in- 
cluding two topics Greengard might be 
able to bridge: One is free software in 
a democratic society, inciting people to 
be more politically active and involved, 
despite being (usually) independent of 


political parties and other traditional | 
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means of shaping society. The other is 
how motivation and peer-recognition 
function in these communities; such 
free-culture communities have much 
in common with scientific communi- 
ties, despite starting off with complete- 
ly different motivations. 

My generation (born in the 1970s), 
including many people in the free soft- 
ware movement, has directly experi- 
enced the great shift computing and 
networking have brought the world, 
fully embracing the technologies. The 
greatest difference between people 
who are just users of computing and 
those striving to make it better de- 
pends on who has the opportunity to 


| appropriate it beyond, say, the distrac- 


tion level, the blind Google syndrome, 
or the simple digestion of “piles of data 
and information [that] do not equate to 
greater knowledge and better decision 
making.” 

Thanks to Greengard for sparking 
some useful thoughts. 

Gunnar Wolf, Mexico City 


Communications welcomes your opinion. To submit a 
Letter to the Editor, please limit your comments to 500 
words or less and send to letters@cacm.acm.org, 
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Can Automated Agents : 
Negotiate with Humans? — 


MapReduce and Parallel 
Database Management 
Systems: Friend or Foe? 


MapReduce: A Flexible Data 
Processing Tool 


What Should We Teach New 
Software Developers? 


Next-Generation Search 
Cross-Language Translation 
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computer museums, and technology 
strategy and management.  . 
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CS Woes: Deadline- 
Driven Research, 
Academic Inequality 


Jeannette M. Wing writes about the negative effects 
of deadline-driven research and Mark Guzdial discusses 
the role of computer science faculty in fostering inequality. 


From Jeannette M. 
Wing's “Breaking the 
Cycle” 

How can we break the cy- 


search? In computer sci- 
ence, there has been a growing trend 
in the past decade or so for researchers 
to publish in workshops and confer- 
ences in order to increase the length 


of their publication list. This situation _ 


is especially true of junior faculty, wor- 
ried about getting tenure; graduate 
students, worried about getting job 
interviews; and now even undergradu- 
ates, worried about getting into gradu- 
ate school. In promotions and tenure 
committee meetings at some schools, 
discussion of the number of papers 


can overshadow discussion of the qual- | 


ity and impact of the candidate’s work. 

We have successfully trained deans 
and provosts that, in computer sci- 
ence, papers in premier conferences 
count as much as or more than papers 
in journals. So, the pressure to publish 
in conferences is even that much more 
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cle of deadline-driven re- | 


DECEMBER 2009 


intense. And presumably the more the 
better. To accommodate the research 
capacity of our field, new workshops 


and conferences (and journals) prolif- | 
| they feel ready, they can submit based 
sites maintaining double-digit rank- | 


erate, resulting today in extensive Web 


ings of conferences. It is now common 
practice to see the conference success 
rate included with each publication 
listed on a candidate’s résumé. (I will 
not repeat the cogent arguments that 
others have given on the subject of 
journals vs. conferences, published 
in Communications,'* but they are rel- 
evant to this topic as well.) 

We are now in a state where our ju- 
nior faculty are mentoring graduate 
students with this deadline-driven ap- 
proach to research. It’s the only value 
system they know and they are passing 
it onto the next generation. When one 
of my own graduate students, after we 


agreed that we would submit a journal — 
version of our conference paper, saidto | 


me, “Jeannette, the author guidelines 
for Journal X don’t specify a page lim- 
it,” I knew something was very wrong 
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with our current culture in computing. 
We are now in a state where the com- 
mon thought-chunk of research is a 
12-month effort that fits in 12 pages. 
We, as faculty advisors, are in a 


_ bind: Do we say to our student, “Yes, go 


ahead and submit to that conference 
[whose due date is looming]” or “No, 
don’t waste your time writing for that 


_ conference. Your work is not ready. 


Spend the time developing the work”? 
Do we give in to the peer pressure our 
students feel, making them potentially 
less competitive when they are on the 
job market? We need to promote a cul- 
ture that encourages faculty and stu- 
dent researchers to take the time need- 
ed to work out their ideas so that when 


on the import of their contribution. 

Moreover, conservatism tends to 
win out in program committees, when 
submissions are competing for a fi- 
nite number of conference slots, and 
in panel reviews for funding agencies, 
when proposals are competing for fi- 
nite resources. This attitude leaves less 
room for the bold, creative, risk-taking, 
visionary ideas, especially those that 
are not fully fleshed out with all the i’s 
dotted and t’s crossed. Note that I have 
nothing against conferences: they are 
important for the expeditious exchange 
of technical ideas, as well as network- 
ing among researchers and between 
academia and industry. I have noth- 
ing against (high-quality) incremental 
research: some research agendas are 
long-term in vision, but rely on making 
progress step by step, building on prior 
research results. 


The consequences of this deadline- | 
driven research are potentially bad for | 
the field. Our focus should be on the 
quality of the research we do. Our goal 
should be advancing the frontiers of 
science and engineering. 

So how can we break this cycle? One | 
place to start is with the department | 
heads. At hiring time, among other fac- 
tors, we should look for a candidate’s 
big idea (or two), not number of publi- 
cations. In mentoring junior faculty, we 
need to stress the importance of quality 
and impact. At faculty evaluation time, 
we should promote and grant tenure 
based on quality and impact. 

Hopefully, we in the community can 
at least start a dialogue on this topic. It 
is for the good of our field—to keep it 
healthy, exciting, and vibrant. 
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Reader’s comment 
A key challenge that our community needs 
to address is how to detect, from an ocean 
of papers, the key innovative ones that 
need to be widely distributed. The new 
Communications is a very good step in this 
direction, but I think that Communications 
will not be sufficient and we'll need 
innovative techniques to efficiently and 
quickly detect the most innovative papers 
in each CS subfield. 

—Olivier Bonaventure 


From Mark Guzdial’s 
“CS Faculty Cause 
Inequality” 
An article in the Sept. 8, | 
2009 edition of the New 
« — York Times argues that 
“Colleges Are Failing in Graduation 
Rates.” Specifically, the article claims 
“t]he United States does a good job 
enrolling teenagers in college, but only 
half of students who enroll end up with 
a bachelor’s degree. Among rich coun- 
tries, only Italy is worse. That’s a big | 
reason inequality has soared, and pro- 
ductivity growth has slowed.” 

That’s a strong claim, that the fail- 
ing rates in college are actually caus- 
ing inequality. This isn’t the first time 


_ high school in greater numbers. It’s 
_ when they get to college that they be- 


“It’s easy to pick 

the ‘best and 
brightest’ who look 
like us, act like us, 
and learn like us. 
The challenge is to 
identify the students 
who are even brighter 
and better than us, 
but don’t look 

like us, act like us, 
or learn like us.” 
—Mark Guzdial 


the Times has made this argument, 
though. The first time I read the claim 
that higher-education faculty are a 


| significant cause of the widening gap 


between the haves and have-nots was 
in a column in 2005 by David Brooks 
of the New York Times. He explicitly ar- 
gued that colleges, rather than beinga 
ladder to improving one’s life, are ac- 
tually reducing the opportunities for 
the poor. Brooks wrote: 

“As you doubtless know, as the in- 
formation age matures, a new sort of 
stratification is setting in, between 
those with higher education and those 
without. College graduates earn nearly 
twice as much as high school gradu- 
ates, and people with professional 
degrees earn nearly twice as much as 
those with college degrees. But worse, | 
this economic stratification is translat- | 
ing into social stratification. Only 28% 
of American adults have a college de- 
gree, but most of us in this group find 
ourselves in workplaces in social mi- 
lieus where almost everybody has been 
to college... The most damning indict- 
ment of our university system is that — 
these poorer kids are graduating from | 


gin failing and dropping out. Thomas 
Mortenson of the Pell Institute for the 
Study of Opportunity in Higher Educa- 
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tion has collected a mountain of data 
on growing educational inequality. As 


| he points out, universities have done 


a wonderful job educating affluent 
kids since 1980. But they ‘have done 
a terrible job of including those from 
the bottom half of the family income 
distribution. In this respect, higher 
education is now causing most of the 
growing inequality and strengthening 
class structure of the United States.’ ” 
CS faculty play a role in this phe- 
nomenon. Noonewholooks at CS1 fail- 
ure rates could argue that CS doesn’t 
contribute to failing grades. Richard 


_ Tapia, in his forward to Jane Margo- 
_ lis’ Stuck in the Shallow End, makes the 
argument explicitly starting from his 


title, “Computer Science is Widening 
the Education Gap.” He wrote: 

“Over the years, I have developed 
an extreme dislike for the expression 
‘the best and the brightest,’ so the au- 
thors’ discussion of it in the conclud- 
ing chapter particularly resonated with 
me. I have seen extremely talented and 
creative underrepresented minority 
undergraduate students aggressively 
excluded from this distinction. While 
serving on a National Science review 
panel years back, I learned that to be 
included in this category you had to 
have been doing science by the age of 
10. Of course, because of lack of oppor- 


| tunities, few underrepresented minori- 


ties qualified.” 

No one is arguing that we should not 
seek the actual “best and the bright- 
est.” The real question is how we make 
that determination and how we devel- 
op those students. It’s easy to pick the 
“best and brightest” who look like us, 
act like us, and learn like us. The chal- 
lenge is to identify the students who 
are even brighter and better than us, but 
don’t look like us, act like us, or learn 
like us. With declining enrollment and 
a population of computer scientists 
who increasingly have the same gender 
and represent only a few ethnic groups, 
we must look beyond the simple defi- 
nitions, find those terrific students 
whom we might have missed in our 
first glance, and help them to develop 
in the ways that best suit them. 


Jeannette M. Wing is a professor at Carnegie Mellon 
University. Mark Guzdial is a professor at the Georgia 
Institute of Technology. 
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ACM and its Partners are Proud to Announce 


Computer Science Education Week 


To encourage increased participation in 
computer science at all educational levels 


December 6 -12, 2009 


Join the Conversation at: 
CSEDWEEK.ORG 


Announcing ACM's New 
Career & Job Center! 


Are you looking for your next IT job? Do you need Career Advice? 


Visit ACM's newest career resource at http://www.acm.org/careercenter 


The ACM Career & Job Center offers ACM members 

a host of career-enhancing benefits!: 

- A highly targeted focus on job opportunities in the computing industry 
- Access to hundreds of corporate job postings 


+ Resume posting — stay connected to the employment market and maintain full control over 
your confidential information 


+ An advanced Job Alert system that notifies you of new opportunities matching your criteria 


+ Live career advice to assist you in resume development, creating cover letters, 
company research, negotiating an offer and more 


The ACM Career & Job Center is the perfect place to 
aie begin searching for your next employment opportunity! 
Visit today at http://www.acm.org/careercenter 


Advancing Computing as a Science & Profession 


ual extension 


in the vir 


In the Virtual Extension 


Communications’ Virtual Extension brings more quality articles to ACM 
members. These articles are now available in the ACM Digital Library. 


Power and Trust in Global 
Virtual Teams 


Niki Panteli and Robert Tucker 


Although the current understanding of 
virtual teams has advanced in significantly 
over the last few years, the authors contend 
it has not taken sufficient account of | 
power dynamics within virtualteams nor | 
sought to explore the nature of power | 
within geographically distributed teams. 
The challenge is to be able to manage | 
power differentials effectively in order 

to allow collaboration to foster within a 
virtual team environment. The question 
addressed in this article is: How is power 
exercised in global virtual teams and how 
can it effectively impact trust development 
and overall team performance in such 
distributed environments? 


Online Privacy, Government 
Surveillance, and National ID Cards 


Sun Sun Lim, Hichang Cho, and 
Milagros Rivera Sanchez 


The authors explore how the online privacy 
concerns of Internet users are related 

to their attitudes toward government 
surveillance and national ID cards. A survey 
of Internet users in five multinational cities 
(Bangalore, New York, Seoul, Singapore, 
and Sydney) found that while positive 
attitudes toward ID cards raise online 
privacy concerns, actual experiences 

using these cards appear to desensitize 
Internet users to privacy intrusions. This is 
possibly due to the perceived convenience 
of these cards, the realization that privacy 
intrusions are not particularly significant, 
or a sense of resignation that the use of ID 
cards will inevitably be accompanied by 
privacy intrusions. 


Security in Dynamic Web Content 
Management Systems Applications 


Ganesh Vaidyanathan and Steve Mautone 


Web Content Management Systems 
(WCMS) allow teams to maintain Web 
content in a dynamic fashion. While this 
“on-the-fly” content creation provides 
Web site authors several advantages, 
there are distinct disadvantages. Indeed, 
organizations are adopting information 
technology without understanding such 
security concerns. The authors illustrate 
how to evaluate open source systems and 
how an evaluation technique in terms of 


_ Lou Hafer and Arthur E. Kirkpatrick 


| for the evaluation of the scholarly 


_ of the application of discoveries. The 
| authors contend software instantiates 


| poor risk management, and lack of staff 
rewards. The failure factors are different 


security may be used in an organization 

to assess a short list of possible WMCS 
systems. This work focuses on security 
issues in WCMS with the objective to 
understand the security issues as well as to 
provide a generic security framework. 


Assessing Open Source Software 
as a Scholarly Contribution 


In academic computer science, 
papers about software are considered 
scholarship but actual software is not. 
The authors propose “best practices” 


contribution of open source software, 
raising publication of code to a status 
comparable to publication of the ideas 

it embodies. Evaluating software as 
scholarship acknowledges the importance 


knowledge in a form that can be applied, | 
unlike the passive knowledge in an article. — 
Considering software as scholarship 

also advances the scientific principle of 
reproducibility. Pseudo-code in an article 

is insufficient for replication; only actual 
code provides enough detail. 


Why Did Your Project Fail? 
Narciso Cerpa and June Verner 


Most research literature on failed software 
projects tends to deal with a few high- 
profile failures. The authors review 

70 failed projects to determine what 
factors lead to project failure. Data was 
collected from a survey that considered 
over 80 software development practices. 
Projects do not fail for a single reason 
alone; they fail for multiple reasons, 
including poor estimation and schedule, 


for in-house and outsourced projects, 

and while organizations do not conduct 
post-mortem reviews software projects will 
continue to fail. 


Visual Passwords: 
Cure-All or Snake-Oil? 


Karen Renaud and Antonella De Angeli 


In our everyday lives we’re expected to 
remember a number of passwords and 
PINs. Since human memory is finite and 
fallible, we often forget them. As an 
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industry, we know that the password has 
outlived its usefulness as an authenticator, 
but no one has come up with a convincing 
and viable alternative. Visual passwords 
have been researched for a decade now. 


| The authors present the different kinds of 


visual passwords, their pros and cons, and 
debate whether they do indeed possess the 


| potential to be a viable alternative to the 
| hated password. 


Positive Externality, Increasing 
Returns, and the Rise in 
Cybercrimes 


Nir Kshetri 


The distinctive geography of cyberspace 
provides an ideal environment for engag- 
ing in opportunistic behavior. This article 
employs increasing returns and externali- 
ties approaches to explain the escalation 
of cybercrimes. The author focuses on 
three positive or self-reinforcing feedback 
systems to examine increasing returns in 
cybercrime-related activities. They are re- 
lated to economic, sociopolitical and cog- 
nitive systems. The author also examines 
three mechanisms that may give positive 
feedback to cybercriminals: inefficiency 
and congestion in the law enforcement 
system, acceleration of the diffusion of 
cybercrime know-how and technology, and 
increase in potential criminals’ predisposi- 
tion toward cybercrimes. 


Technical Opinion: 


| Are Employees Putting Your 


Company At Risk By Not Following 
Information Security Policies? 


Mikko Siponen, M. Adam Mahmood, 
and Seppo Pahnila 


| Careless employees who do not follow 


information security policies constitute a 
serious threat to information privacy and 
confidentiality. A field research, conducted 
to determine which issues are vital toward 


| employees’ compliance with these policies, 


indicates that the visibility of security 
activities and expectations of peers have 

a positive impact on employees’ ability 

to assess the severity of security threats. 
These precursor variables also affect 
employees’ beliefs that the security policies 
compliance is an effective way to combat 


_ these threats. In turn, if employees realize 


the severity of security threats, they often 
have a strong intention to comply with 
security policies. 
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Crowdsourcing and 
the Question of Expertise 


There is an in inherent weakness to 
crowdsourcing that should bother 
computer scientists and computer 
users alike. It’s the fact there is no 
clear difference between “the wis- 
dom of the crowd” and “the mob that 
rules.” What’s missing is a measure 
of discernment. 

The Internet is awash in informa- 
tion that demands selectivity, leading 
Newsweek among others to predict 
the rise of online experts and reli- 
able information (http://www.news- 
week.com/id/119091). The assess- 
ment seems overly optimistic. There 
are some efforts to rate expertise on 
the Internet (http://cacm.acm.org/ 
news/42206), but most of us are left with coping strategies that limit where you 
go, what you see, and who you trust. It is not the kind of open investigation that 
promotes learning or understanding. 

Crowdsourcing doesn’t really help sort through or synthesize information, in 
fact, it might do the opposite. Research shows that it favors popular opinion and 
therefore reinforces homogeneity (http://cacm.acm.org/news/42525). That’s not 
hospitable to unconventional or idiosyncratic views. 

There is an upside, for sure. Luis von Ahn’s GWAP (http://www.gwap.com/ 
gwap/about/) uses computer games “to solve problems for humans all over the 
world.” And Galaxy Zoo tapped about 250,000 visitors to classify nearly one mil- 
lion galaxies (http://cacm.acm.org/magazines/2009/10/42492). 

Now the downside: The limitations of crowdsourcing are becoming appar- 
ent, even to its defenders. Blogger Josh Berkus summarizes key weaknesses, 
saying the term is “evil” and carries too much baggage (http://it.toolbox.com/ 
blogs/database-soup/never-say-crowdsourcing-34331). In the end he concludes 
that the problem is mainly about improper usage. But the issue is bigger than 
that. The problem with crowdsourcing is that there is no verity. In fact, “cor- 
rectness [is]...anathema to crowdsourced systems” (http://cacm.acm.org/maga- 
zines/2009/7/32094). That’s a small concern when rating movies, but researchers 
and scientists need something more. 

Science needs higher standards. This was illustrated by Newsweek when it de- 
cried science education in the U.S. and showed how “wisdom of the masses” is an 
oxymoron. It described how John Holdren, director of the White House Office of 
Science and Technology Policy, trades candor for political timidity when discuss- 
ing science policy (www.newsweek.com/id/216505). “He must sell his ideas to 
people who couldn’t pass high-school algebra—and who believe they know more 
than he does.” 

Crowdsourcing empowers followers. It risks weakening leaders. 
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ACM 
Member 
News 


HALL WINS DUNCAN 
DAVIES MEDAL 

ACM President Dame Wendy 

Hall received the Duncan Davies 
Medal, which is awarded annually 
by the Research and Development 
Society to an individual who 

has made an outstanding 
contribution toward making 

the United Kingdom the world’s 
best-performing research and 
development environment. 


BERMAN HONORED WITH 
KEN KENNEDY AWARD 
Francine 
Berman was 
awarded the 
inaugural Ken 
Kennedy Award 
‘i from ACM and 
oe IEEE Computer 
Society for “her influential 


| leadership in the design, 


development, and deployment 
of national-scale cyber- 
infrastructure.” A vice president 
for research at Rensselaer 
Polytechnic Institute, Berman 
was recognized for her work as a 
pioneer in grid computing anda 
leading advocate for the 
development of a national-scale 
cyberinfrastructure for the 
access, use, stewardship, and 
preservation of the digital data. 
In an email interview, 
Berman discussed the current 
challenges and opportunities 
in cyberinfrastructure. “There 
are immense opportunities 
that focus on the development 
of cyberinfrastructure to drive 
innovative solutions for some 
of the most complex and 
compelling societal challenges 
of our age: health care, energy, 
the environment, safety, and 
economic stability,” said 
Berman. “Some of the greatest 
breakthroughs we are now 
seeing in these areas come from 
the innovative use of computers, 
information, sensors, networks, 
scientific instruments, and other 
21st century tools. The challenge 
is to develop a system to support 
and deploy cyberinfrastructure 
as infrastructure: sustainable 
business models, appropriate 
standards, low-barrier-to-access 
user interfaces, and inter- 
operability. The development 
of cyberinfrastructure as 
infrastructure truly constitutes a 
grand challenge for our age.” 


News 
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Blueprints for 
Self-Assembly 


Researchers are using tools from information theory and computer 
science to facilitate the automatic creation of nanoscale structures. 


ELF-ASSEMBLY, BY Which 
atoms, molecules, or other 
nanoscale components 
spontaneously organize 
into something useful, 
sounds so simple. Just mix a few chem- 
icals and wait for a new plastic, drug, 
or electronic component to form at the 
bottom of your test tube. 

Unfortunately, it’s not so easy. Coax- 
ing tiny particles to arrange themselves 
in an orderly way, with desirable and 
repeatable properties, is enormously 
complex, typically involving a great 
deal of trial and error in the laboratory. 

But now two mathematicians, us- 
ing tools from information theory and 
computer science, have found a new 
and relatively simple way to orches- 
trate the assembly of nanostructures. 
And they have devised algorithms that 
can produce mathematical proofs that 
their structures are optimum. 

Henry Cohn, a principal researcher 
at Microsoft Research New England and 
Abhinav Kumar, an assistant professor 
of mathematics at Massachusetts In- 
stitute of Technology, have employed a 
rich mix of techniques—including heu- 
ristic algorithms, linear programming, 


search optimization, and error-correc- | 


The Schlegel diagram for a regular 120-cell 
structure (with a dodecahedral facet in red) 
from Henry Cohn and Abhinav Kumar's self- 
assembly research. 


tion theory—to produce their results. 


Writing in a recent issue of Proceed- | 


ings of the National Academy of Sciences, 


Cohn and Kumar describe their suc- | 
| cess in designing a system to direct 20 


randomly placed particles on a sphere 
to form into a perfect dodecahedron 
with 12 pentagonal faces, a structure 
that minimizes potential energy and, 
hence, maximizes stability. 

Although the methods have yet to 
be implemented in a lab, they may ul- 
timately find use in such diverse fields 
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| as electronics, communications, and 

medicine. For example, Cohn says, a 
| drug company might produce a time- 
release drug by encapsulating tiny drug 
droplets in structures, such as dodeca- 
hedrons, that have certain desired prop- 
erties. The idea is to simplify the search 
for the proper materials and conditions 
for self-assembly of such things. 

The forces between particles deter- 
mine whether or not they will organize 
into a stable and desired configura- 
tion. Cohn and Kumar's blueprints 
for self-assembly specify the required 
_ inter-particle forces and distances via 
| formulas called potential functions. 

Traditional approaches to this 
problem use complex potential func- 
tions with multiple potential wells, lo- 
cal energy minima that cause the par- 
ticles to settle into certain positions. 
But Cohn and Kumar devised a way to 
find potential functions that cause the 
particles to organize themselves more 
directly, without relying on local min- 
ima. The resulting formulas are sim- 
pler and, hence, would be much easier 
to implement in a lab or manufactur- 
ing line, they say. 

In the second part of the problem, 
Cohn and Kumar go on to mathemati- 
cally prove that the dodecahedron 
and several other structures built by 
_ their methods are in fact the unique 

ground states, or globally energy-min- 
imizing arrangements of particles. 
They do that using linear program- 
ming bounds, a tool borrowed from 
error-correction theory. 

The researchers find their optimal 
potential functions using an iterative 
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process they call “simulation-guided 
optimization,” which alternates be- 
tween molecular dynamics simulation 
and linear programming. A trial poten- 


tial function is chosen, and a number | 
of simulations—with random starting | 


positions for the particles—are run, 


allowing the particles to interact until | 


they settle into a structure. The result is 
a list of possible candidate structures 
for the ground state. 


Then linear programming looks for | | 


a new potential function that makes 


all the candidates rank worse than the | 
target configuration, and the whole | 


process is repeated. The researchers 
call this process a heuristic algorithm 
because “it is not guaranteed to work.” 
They tested their potential function 
1,200 times with different starting po- 
sitions of the 20 particles, and all but 
six converged to the dodecahedron. 
Not surprisingly, a process that re- 
petitively applies simulation and linear 
programming can be computationally 
very taxing. Indeed, a large part of the 
mathematicians’ effort went into find- 
ing efficient search methods. “The 
problem is how not to get stuck at some 
sub-optimal solution,” Cohn says. 
Graphs of potential functions 
against inter-particle distances for 


Princeton University professor of chemistry 
Salvatore Torquato, who pioneered an 
inverse approach to self-assembly. 


| Cohn/Kumar solutions tend to be 
smooth, with particle interactions de- 
creasing monotonically with distance. 
But those for the more conventional ap- 
proaches—those that employ potential 
wells—are complex and bumpy, with 
/ numerous local maxima and minima. 

“The problem with potential wells is 

they are much harder to manufacture 

in the lab,” Kumar explains. “But the 
' kinds of functions you see in nature, 


Report from the ACM Nominating Committee 


and the kind you might be able to gen- 
erate, are basically functions with not 
too many wiggles. We ask the question, 
‘Can you get a potential function with 
essentially no wiggles?’ ” 


The Inverse Approach 

Cohn and Kumar’s work builds on ear- 
lier research by Salvatore Torquato, 
a professor of chemistry at Princeton 


_ University. Starting with a paper writ- 


ten four years ago, Torquato pioneered 
what he calls the inverse approach to 


| self-assembly. In the traditional for- 


ward approach, known particle interac- 
tions are used to predict a likely result- 
ing structure. But the inverse method 


| starts with some desired configuration 


and derives the optimal inter-particle 
interactions that would spontane- 


| ously organize into that target struc- 


ture. Torquato has used pure theoreti- 
cal work as well as numeric computer 
simulation to find potential functions 
that can lead to the self-assembly of 
materials into squares, honeycombs, 
diamond shapes, and lattices. 

“This is a completely different way 


| of thinking about designing these 


structures, and it’s tailor-made for self- 
assembly,” he says. 
Torquato says the inverse approach 


Slate of Nominees for ACM General Election 


In accordance with the 
Constitution and Bylaws of the 
ACM, the Nominating Committee 
hereby submits the following slate 
of nominees for ACM’s officers. 

In addition to the officers of the 
ACM, two Members at Large will 
be elected. The names of the 
candidates for each office are 
presented in random order below: 


PRESIDENT (7/1/10-6/30/12): 


SECRETARY/TREASURER 
(7/1/10-6/30/12): 


Alexander L. Wolf, 
Imperial College London 


| Carlo Ghezzi, 
Politecnico di Milano 
MEMBERS AT LARGE 
(7/1/10-6/30/14): 
Vinton G. Cerf, 


elected offices of the ACM may 

| also be nominated by petition 
of one percent of the Members 
who as of November 1 are eligible 
to vote for the nominee. Such 
petitions must be accompanied 
by a written declaration that 
the nominee is willing to stand 
for election. The number of 
Member signatures required 
for the offices of President, Vice 


Segal, c/o ACM Headquarters. 

All candidates nominated by 
petition are reminded of the 
requirements siated in the Policy 
and Procedures on Nominations 
and Elections that a candidate for 
high office must meet in order to 

| serve with distinction. Copies of 
this document are available from 
Rosemary McGuinness, Office 


Alain Chesnais, 
SceneCaster.com 


Joseph A. Konstan, 
University of Minnesota 


VICE PRESIDENT (7/1/10-6/30/12): 


Barbara G. Ryder, 
Virginia Tech 


Norman P. Jouppi, 
Hewlett Packard 
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Fei-Yue Wang, 


| Chinese Academy of Sciences- 


Institute of Automation/ 
University of Arizona 


Satoshi Matsuoka, 
Tokyo Institute of Technology 


Salil Vadhan, 
Harvard University 


The Constitution and Bylaws 
provide that candidates for 


DECEMBER 2009 VOL. 52 NO. 12 


President, Secretary/Treasurer 
and Members at Large, is 687. 
The Bylaws provide that such 
petitions must reach the Elections 
Committee before January 31. 
Original petitions for ACM offices 
are to be submitted to the ACM 
Elections Committee, c/o Pat Ryan, 
COO, ACM Headquarters, 2 Penn 
Plaza, Suite 701, New York, NY 
10121, USA, by January 31, 2010. 
Duplicate copies of the petitions 
should also be sent to the Chair of 
the Elections Committee, Gerry 


of Policy and Administration, 
ACM Headquarters. Statements 
and biographical sketches of all 
candidates will appear in the May 
2010 issue of CACM. 

The Nominating Committee 
would like to thank all those who 
helped us with their suggestions 
and advice. 

Stuart Feldman, Chair, Fabrizio 
Gagliardi, Susan Graham, Mathai 
Joseph, Lucy Sanders. 


IVERSITY, OFFICE OF COMMUNICATIONS 


PHOTOGRAPH BY BRIAN WILSON, PRINCETON UN 


driven by theoretical and computation- 
al methods is the most direct way to de- 
sign materials that can be produced by 
self-assembly. Some such substances 
have counterintuitive properties, such 
as materials that shrink when heated 
or that, when stretched, expand at 
right angles to the direction of stretch. 
The latter property might be useful in 
a foam designed to provide a tight seal 
between two surfaces, for example. 
“The game is to show that in fact you 
are not limited to what you get from mo- 


lecular interactions,” Torquato says. | 


“What kinds of interesting structures, 
that typically nature doesn’t make, can 
you make computationally? We ask 
the fundamental questions: What’s 
permitted? What’s not? We still don’t 
know the answers to those questions.” 

While researchers using the inverse 
method increasingly rely on concepts 
from information theory, those em- 
ploying the forward approach are more 


Edisonian, relying more on hands- | 


on, trial-and-error laboratory work, 


Torquato says. “They synthesize new 
[substances] and they get what they get 
at the end of the day,” he says. “But the 
most direct way to get self-assembly is, 
OK, you want a particular target config- 
uration, then let’s design the interac- 
tions to get those.” 

But Torquato says it’s not a battle 
between Edisonians and computer sci- 
entists. “The idea would be to combine 
our computational theoretical approach 
with experimental synthesis technol- 
ogy, and that, I think, is the way material 
science will be done in the future.” 

Torquato hails as a “wonderful con- 


eres 
The research of Cohn 


and Kumar may 


have applications 
for fields such 

as electronics, 
communications, 
and medicine. 


repel each other and stay as far apart as 
possible. Cohn says he was able to ap- 
ply some of the theory of error-correc- 
tion coding to the self-assembly prob- 
lem by using it to tell particles how to 
stay far apart. 

“Some of the most interesting ar- 
eas for research are where there is an 
imbalance between two fields, where 
each field knows something that the 
other field hasn’t figured out yet,” 
Cohn says. “If you can make the con- 
nection, you can engage in arbitrage 
and transfer some information in each 
direction.” 


eee 


— 


RI 


Further Reading 


Cohn, H. and Kumar, A. 
Algorithmic design of self-assembling 
structures, Proceedings of the National 


| Academy of Sciences 106, 24, June 16, 


tribution” Cohn and Kumar’s methods | 


for mathematically proving theorems 
about their potential functions. That 
was feasible, he said, because the spaces 
they considered, suchas spheres, were fi- 
nite and bounded. “Can their approach 
scale to Euclidian space?” he asks. 
“There is no reason to think one can’t do 
that. Until they came along, there was 
no rigorous proof of any kind.” 

Cohn and Kumar found inspiration 
in some unlikely recesses of informa- 
tion science, such as the theory be- 
hind the error-correction codes used 
in communication systems. In a noisy 
communication channel, one wants to 
keep multiple signals as well separated 
as possible so they don’t get confused. 
Similarly, in chemistry, particles often 


2009. 


Cohn, H. and Kumar, A. 
Universally optimal distribution of points on 


| spheres, Journal ofthe American Mathematical 


Society 20,1, Sept. 5, 2006. 


Torquato, S. 

Inverse optimization techniques for 
targeted self-assembly, Soft Matter 5, 6, 
March 21, 2009. 


Rechtsman, M. C., Stillinger, F. H., Torquato, S. 
Synthetic diamond and Wurtzite structures 
self-assemble with isotropic pair 
interactions, Physical Review E 75, 3, id 
031403, March 26, 2007. 


Cohn, H. and Kumar, A. 

Counterintuitive ground states in soft-core 
models, Physical Review E 78, 6, id. 061113, 
Dec, 12, 2008. 


Gary Anthes is a technology writer and editor based in 
Arlington, VA. 
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Milestones 


CS Awards 


IBM’s Blue Gene supercomputers 
and members of the computer 
science community were 
honored for their innovative 
research. 


NATIONAL MEDAL OF 
TECHNOLOGY AND 
INNOVATION 

Ina special ceremony at the 
White House, President Obama 
honored IBM’s Blue Gene line 
of supercomputers with the 
National Medal of Technology 
and Innovation in recognition 
of its lasting contributions to 


_ America’s competitiveness, 


standard of living, and quality 
of life through technological 
innovation. 


JIM GRAY eSCIENCE AWARD 
University of California, Santa 
Barbara 
professor 

Jeff Dozier 

was awarded 
Microsoft 
Research’s 
second annual 
Jim Gray eScience Award. 
Dozier was cited for “his 
pioneering research on remote 
sensing, water resources, 

and climate change, and his 
contributions to the integration 
of environmental science and 
computer science.” 


SEYMOUR CRAY AWARD 
IEEE Computer Society bestowed 
its 2009 Seymour Cray Computer 
Engineering Award to Kenichi 


_ Miura, a professor at the National 


Institute of Informatics in Tokyo, 
for “unique contributions to the 
field of computer engineering by 
bringing a strong background 

in numerical algorithms and 
applications to the task of 
designing systems that deliver 
high performance on real 
scientific applications.” 


SIDNEY FERNBACH AWARD 
IEEE Computer Society presented 
the 2009 Sidney Fernbach Award 
to Roberto Car and Michele 
Parrinello, developers of the Car 
Parrinello Molecular Dynamics 
approach. Car, a professor of 
chemistry at Princeton University, 
and Parrinello, a professor of 
computational science at ETH 
Zurich, were recognized for laying 
“the foundation for a modern 
approach to the chemistry and 
physics of materials. Their 
methodology was revolutionary, 
increasing the speed of 
simulations and propelling a 
major force in science.” 
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Ready for a Web OS? 


Anew generation of browsers may finally herald the long-awaited 
convergence of the Web and operating system. 


ACK IN 1995, Netscape co- 
founder Mare Andreessen 
predicted that his fledgling 
Web browser would one 


day render Windows obso- 


lete. Fifteen years later, Netscape is | 


long gone, and the traditional desk- 
top operating system (OS) remains 
firmly established on most personal 
computers. Meanwhile, Web brows- 
ers still look a lot like they did in the 
mid-1990s, running inside applica- 


tion windows. In hindsight, Andrees- | 


sen may have spoken a bit too soon. 
But history may yet prove him right. 


The hegemony of the desktop OS | 


is starting to fracture with the emer- 
gence of a new generation of browsers 
that may finally herald the long-await- 
ed convergence of Web and OS. An 
enormous amount of Web OS devel- 
opment is currently under way, with 
the development of Web standards, 
such as HTMLS, to add richer capa- 
bilities and features; new technolo- 
gies like Microsoft's Xax and Google’s 
Native Client that make browsers and 
their applications as capable and 
powerful as desktop applications; 
and architectural changes to brows- 
ers, making them process oriented, 
which increases their robustness and 
security. 

A Web OS offers enormous prom- 


ise. Potentially, it could take the best | 


of the Web—the rapid deployment 
and updating of new applications, de- 
vice independence, and the ease and 
convenience with which large com- 
munities can collaborate and share 
information—and combine it with 
the advantages of desktop applica- 


tions—operating at machine speed, | 


rich and interactive interfaces, and 


access to local hardware—and side- | 


step many of the security and compat- 
ibility issues currently plaguing desk- 
top OSs. Before the Web OS becomes 
a practical reality, however, browser 
developers must overcome several 
major obstacles to security and de- 
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vice integration that continue to tilt 
the balance of power in favor of the 
desktop OS. 

For many average computer us-_ 
ers, the browser has become their de | 
facto OS—a tool of choice for e-mail, 
personal finance, and other activities 
that were once the domain of desktop 
applications. Today’s Web has comea 
long way from its original incarnation 
as acollection of passive, hyperlinked 
documents. Web developers now rou- | 
tinely use sophisticated scripting lan- 
guages and other active client-side 
technologies to provide users with 
rich experiences that approximate 
the performance of desktop applica- 
tions, including features like drag- 
and-drop, keyboard shortcuts, and 
other desktop-like affordances that 
have become commonplace. 

The latest Web browsers include 
powerful features that further close 
the gap between Web applications | 


_ and native desktop applications. Most 


major browsers have significantly in- 
creased the speed of their JavaScript 
engines, allowing more complex and 
computationally demanding applica- 


Web Page Instance 


Access control | 
and audit log 


| 


To improve the performance and security of 
the OP Web browser, the main architecture 
is divided into five main subsystems: 
browser kernel, storage subsystem, network 
subsystem, user-interface subsystem, and 
Web page instances. 
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tions to be developed. And Web appli- 
cations will soon benefit from evolved 
Web standards, such as HTMLS5, fea- 
turing offline support, local storage, 
geolocation capabilities, graphics ac- 
celeration, and perhaps access to cli- 
ent devices, such as a scanner or video 
camera. 

“With HTMLS5, we're going to see a 
new generation of rich Web applica- 
tions,” says Adam Barth, a postdoctor- 
al fellow at the University of California 
at Berkeley who focuses on privacy 


_ and browser security. “But I suspect 


it will take a while for application de- 
velopers to realize the full potential of 
the various HTML5 technologies like 
canvas, local storage, and video.” 

In a similar vein, experimental 
technologies, such as Xax and Na- 
tive Client, allow Web publishers to 
implement Web programs as native 
x86 code that executes directly and 
safely on the client’s processor, elimi- 
nating the interpretation or compila- 
tion overhead of scripted or byte-cod- 
ed languages and frameworks such 
as Java, JavaScript, Silverlight, and 
Flash. 

“T think we’re going to see a pro- 
liferation of different scripting lan- 
guages in the browser,” says Barth. 
“In the past, JavaScript had a mortal 
lock as the lingua franca of the Web, 
but now, with technologies like Xax 
and Native Client, anyone can write 
an interpreter for their favorite lan- 
guage and run the interpreter in the 
browser without pestering the user to 


| install the interpreter.” 


Sam King, assistant professor in 


_ the computer science department at 


the University of Illinois at Urbana- 
Champaign, thinks JavaScript is too 
entrenched in today’s Web ecosystem 
to cede its dominant position anytime 
soon. “JavaScript is tightly integrated 
into modern browsers and a funda- 
mental part of the current Web,” he 
says. “However, as technologies such 
as Xax and Native Client show up in 


BROWSER", PROC 


browsers, Java, Silverlight, and Flash 
will certainly have new competition.” 


Multiple Concerns 

As Web sites take advantage of im- 
proved client-side _ technologies, 
browsers will need to start coping with 
a growing range of performance, reli- 
ability, and security concerns. “As we 
make Web browsers more powerful, 
we need to keep in mind how mali- 
cious Web sites might abuse that addi- 
tional power,” says Barth. For example, 
a truly secure browser should let users 
visit sites that contain buggy or mali- 
cious code without fearing about the 


integrity of their OS, applications, or | 


private data. 

To address these concerns, re- 
searchers are reconsidering the un- 
derlying architecture of the Web 


browser. New multiprocess browsers, | 


such as Tahoma, Google Chrome, In- 
ternet Explorer 8, and the OP browser, 
place separate Web applications in 
their own operating system processes 
or virtual machines, allowing the un- 
derlying host operating system or vir- 
tual machine monitor to ensure that 
crashes and slowdowns in one Web 
application do not affect the perfor- 
mance or robustness of other applica- 
tions. “By decomposing the browser, 
you can separate out the security logic 
from the implementation,” says King. 

To improve security, browsers such 
as Chrome and Internet Explorer 8 
have been refactored to run untrusted 
components and Web code in a low- 
privilege sandbox, limiting the expo- 
sure of browser vulnerabilities and 
the damage that can be inflicted by a 
malicious Web application. Mozilla is 
exploring a similar protection mecha- 
nism in its Electrosis project that will 


strengthen security features in a fu- | 
ture version of Firefox. Meanwhile, | 


Microsoft Research developers have 
taken this approach astep further with 
the prototype Gazelle browser, which 
separates different Web sites into dis- 
crete protection domains. With the OP 
browser, researchers at the University 
of Illinois at Urbana-Champaign ex- 
plored applying OS principles to Web 
browser design by breaking the brows- 
er program into smaller subsystems. 
By isolating processes within a 
browser, browsers can mediate inter- 
actions between Web programs, such 


_ improved client- 


—— 
As Web sites take 
advantage of 


side technologies, 
browsers must 
cope witha 
growing range 

of performance, 
reliability, and 
security issues. 


as mashups and embedded widgets, 
to prevent one program from stealing 
information from or causing damag- 
ing side effects to other programs. 
However, the techniques that accom- 
plish this often come at the expense 
of backward compatibility, making 
them more difficult to deploy. 

Looking further ahead, questions 
of compatibility will continue to arise 
as browsers must negotiate a growing 
tangle of local computing resources 
such as offline storage, cameras, mi- 
crophones, geolocation, and graphics 
acceleration hardware—all of which 
have their own security and perfor- 
mance issues. 

Geolocation provides an instruc- 
tive example of the types of challenges 
that browser developers will likely face 
in the near future. “Geolocation is an | 
interesting case from a security point | 
of view because we’re resorting to ask- 
ing users to grant Web sites addition- 
al privileges to read location data,” 
says King. Developers must negotiate 
delicate trade-offs in giving users an 
appropriate level of control without 
letting them also damage their sys- 
tems. “Developing a security policy 
for accessing local data and hardware 
is still an open, difficult, and impor- 
tant research problem,” he says. 

The continued proliferation of 
devices—each with its own OS and in- 
terface—may provide further impetus 
toward a consistent, predictable Web | 
OS. With new devices like the Palm 
Pre smartphone, the CrunchPad Web 
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tablet, and various netbooks running 
Google’s forthcoming Chrome OS, 
all user interaction will take place 
through a browser or Web-based ap- 
plications. While each of these devic- 
es may still have a different OS pro- 
viding a scaffolding of background 
processes, users will increasingly 
experience these devices through the 
filter of a Web interface. 

As developers take advantage of 
these emerging technologies to craft 


| Web-based experiences across a grow- 


ing range of devices, the OS will likely 
continue to recede from users’ aware- 
ness—and perhaps eventually disap- 


| pear altogether. But even if the tradi- 


tional OS sticks around in some form 
for years to come, it may not matter 
much to anyone except developers. 
“I’m not sure users care that much 
about the computing platform,” says 
Barth. “Users seem to care much more 
about what they can do with technol- 
ogy than how it gets done.” 
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Making Automation Work 


Today’s automated systems provide enormous safety and convenience. 
However, when glitches, problems, or breakdowns occur, the results can be catastrophic. 


1’S NO SECRET that engineers 

and designers constantly seek 

to build safer and more con- 

venient systems. And, over the 

last century, planes, trains, 
automobiles, and industrial machines 
have become far more automated and 
efficient. However, when a Metro sub- 
way train rammed into another train in 
Washington, D.C. last June, designers 
had to confront the unpleasant real- 
ity that automation may have been the 
cause. The accident, which killed nine 
people and injured 80, may have been 
rooted in a computer malfunction and 
the operator’s inability to manually ap- 
ply the brakes quickly enough. 

The Metro train accident lies at the 
heart of what human factors experts re- 
fer to as the “automation paradox.” As 
automated systems become increas- 
ingly reliable and efficient, the more 
likely it is that human operators will 
mentally “switch off” and rely upon the 
automated system. And as the auto- 
mated system becomes more complex, 
the odds of an accident or mishap may 
diminish, but the severity of a failure is 
often amplified. 

As John D. Lee, a professor of indus- 
trial and systems engineering at the 
University of Wisconsin at Madison 
told the Washington Post: “The better 
you make the automation, the more 
difficult it is to guard against these cat- 
astrophic failures...” 


Understanding how people and ma- | 


chines interact is infinitely complex. 
Programming all the various possibili- 
ties and scenarios into a system can 
tax even the best design and engineer- 
ing experts. What’s more, as technol- 
ogy evolves, the entire process grows 
more convoluted and iterative. In some 
cases, experts say, it’s wise to ask what 
purpose automation serves and when 
it’s best to use it and eschew it. 

What is the fallout from automa- 
tion glitches? Where do programmers, 
designers, and engineers typically fall 
short? And what can technologists do 
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The June 22, 2009 Metro subway train crash in Washington, D.C., in which nine people died 
and 80 were injured, is the deadliest accident in the Metro's 33-year history. 


to build better systems? There are no 
simple solutions. But as Donald Nor- 
man, professor of computer science at 
Northwestern University, co-founder of 
Neilson Norman Group, and author of 
The Design of Future Things, says, “De- 
signers often make assumptions or act 
on incomplete information. They sim- 
ply don’t anticipate how systems will 
be used and how unanticipated events 
and consequences will occur.” 


Human-Machine Interface 

It’s clear that automation has provided 
enormous gains to society. Safer and 
more efficient factories; faster police, 
emergency, and fire response; and 
more user-friendly and safer automo- 
biles are only a few of the benefits. 


fluid flow velocity, may have became 
blocked by ice. At that point, they may 
have stopped emitting signals, and ex- 
perts say that the pilots could have en- 
countered false speed readings. In fact, 
the jet—which was coping with a series 
of storms, including a severe thunder- 
storm—reportedly relayed a signal that 
its computer system no longer knew 


_ the speed of the aircraft, and that auto- 


Yet, at the same time, it takes little | 


effort to find evidence of breakdowns 
between human and machine. 

The crash of Air France Flight 447 
that occurred over the Atlantic Ocean 
last June—killing all 228 people 
aboard—may have been caused by a 
malfunction in a speed sensor. The 
plane’s Pitot tubes, a pressure mea- 
surement instrument used to track 
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matic pilot and thrust functions were 
switched off. This may have forced the 
pilots to take over manual control dur- 
ing chaotic, if not impossible, flying 
conditions. 

There are also plenty of examples 
of humans having trouble with auto- 
mation systems in everyday life. As au- 


| tomobiles become more automated, 


new problems crop up. For instance, 
motorists blindly follow the incorrect 
directions provided by a navigation 
system, even though a glance at the 
road would indicate there’s an obvious 
error. A few motorists have even driven 
off a cliff or into oncoming traffic after 
following directions explicitly. What’s 
more, studies show that many motor- 
ists use automation features, such as 


AP PHOTO 
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adaptive cruise control, incorrectly. In 
some cases, Norman says, these auto- 
mated systems cause the car to speed 
up as motorists exit a highway because 
there’s suddenly no car in front. If a 
driver isn’t paying attention, an acci- 
dent can occur. 

In the case of airplane pilots and 
train operators, one solution is regular 
training sessions in which the pilot or 
operator is required to turn off their 
automated system and operate every- 
thing manually. This can help them re- 
tain their skills and alertness. 

But even this is not likely to elimi- 
nate breakdowns. Human-machine 


interface failures occur for a number | 


of reasons, experts say. Sometimes, de- 
signers rely on a wrong set of assump- 
tions to build a system. They simply 
don’t understand the way people use 
technology or the cultural differences 
that occur. In some instances, thou- 
sands and sometimes millions of vari- 
ables exist and capturing everything 
in a single algorithm is exceedingly 
difficult. In fact, Norman argues that 
machine logic doesn’t necessarily jibe 
with the human brain. “If you look at 
‘human error’ it almost always occurs 


when people are forced to think and | 


act like machines,” he says. 

Worse, complex algorithms often 
prompt humans to relate to devices as 
if they were fellow human beings. As 


a result, the autopilot on a plane, the | 


cruise control on a car, and automated 
speed-control systems in mass transit 
become either aids or crutches, de- 
pending the situation. 

Too often, the sum of a system is not 
equal to the individual parts, says Sid- 
ney W. A. Dekker, director of research at 
the Leonardo da Vinci Center for Com- 
plexity and Systems Thinking at Lund 
University in Sweden. “There is often a 
great deal of human intuition involved 
in a process or activity and that’s not 


something a machine can easily du- | 


plicate,” says Dekker. “If you look at 
delivering babies, there’s a reason we 
have midwives and nurses. Machines 
can monitor and help, but they can’t 


detect subtle signs and they’re unable | 


to adapt to situations as seamlessly.” 
David D. Woods, professor of cog- 
nitive engineering at Ohio State Uni- 
versity, says that designers can eas- 
ily succumb to the trap of thinking 
“a little more technology will solve 


the problem.” However, understand- 
ing variables and identifying possible 
exceptions and disruptions is para- 
mount. For example, when the Metro 
D.C. train crashed, it may have been 
due to wet leaves on the tracks and a 
computerized system that wasn’t pro- 
grammed for such a scenario. “The 
automation system functioned as it 
was designed,” Woods says. “The situ- 
ation simply fell outside the model of 
what engineers envisioned.” 


Beyond Failure 

Make no mistake, human factors ex- 
perts constantly scrutinize automa- 
tion. Many believe that if human error 
exists, it falls on the shoulders of those 
engineering, designing, and program- | 
ming technology. “In reality, there is no 
such thing as operator error. Too often, 
systems aren’t designed as whole and 
those creating them overlook impor- | 
tant factors,” argues Nancy Leveson, 
professor of aeronautics and astronau- 
tics at Massachusetts Institute of Tech- 
nology and author of the forthcoming 
book Engineering a Safer World. 

Yet, progress is taking place. Con- 
sider the airline industry: In 1989, 
1.4 crashes per 1 million departures 
occurred. By 2008, the number had 
dropped to 0.2 fatal accidents per 1 
million departures. In fact, crashes 
have steadily dropped over the de- | 


cades while survivability has increased. | . 


Dekker, who is a pilot and has flown | 
various aircraft, including a Boeing 
737, says that the industry has got- 
ten serious about stamping out flaws, 
bugs, and oversights. 

These improvements have taken 
place because the airline industry has 
moved beyond studying ergonomics | 
and discreet processes. In fact, Leveson 
says that researchers have put a micro- 
scope to cognitive functions, psycholo- 
gy, cultural issues, and a variety of oth- 
er components that comprise human 
factors. “They have evolved toward a 
system view and worked to understand 
how everything—hardware, software, 
procedures, and humans—interact. | 
It’s a model that other industries must | 
embrace,” she says. 

One thing is certain: Automation | 
disconnects won’t disappear anytime | 
soon. Leveson believes that, ultimate- | 
ly, the people designing systems must — 
take a more holistic view and get past | 


DECEMBER 2009 


VOL. 52 


News 


the notion that when a problem or 
breakdown occurs it’s a result of “hu- 
man error.” She believes that univer- 
sities must place a greater focus on 
human factors and that programmers 
and others must understand that, with- 
out a big picture view of what they are 
building, the end result will continu- 
ally fall short. 

Others, such as Dekker, argue that 
society must examine larger issues, 
including whether automation auto- 
matically translates into progress. “In 
reality, not every function or process 
is best automated,” he says. “In some 
cases, automation simply creates new 
or different tasks and doesn’t provide 
any real benefit.” Automation may also 
change processes to the point where 
people are more confused and entirely 
new social dynamics take place. At that 
point, he says, designers may attempt 
to add new features, which only ratchet 
up confusion and complexity further. 

To be sure, imperfect people con- 
tinue to build imperfect systems. The 
need to focus on human-machine in- 
terfaces has never been greater. “De- 
signers, engineers, programmers, and 
others must take an expansive view of 
automation and understand all the 
possibilities and variables,” concludes 
Norman. “Only then can we build sys- 
tems that improve performance and 
solve real-world problems.” 
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Problem Solvers 


This year’s Grace Hopper Celebration focused 


on using technology for social good. 


VER THE PAST 15 years, the 

Grace Hopper Celebration 

of Women in Computing 

has become one of the in- 

dustry’s premier forums for 
womenincomputerscience. Cofounded 
in 1994 by Anita Borg and Telle Whitney 
and inspired by the legacy of Grace Mur- 
ray Hopper, the conference balances a 
broad range of technical talks with pro- 
fessional and personal programs. Once 
a modest gathering of several hundred 
women, it has since grown into a large, 
four-day affair—held this past October 
in Tucson, AZ—with a tightly packed 
schedule and more than 1,600 attend- 
ees from 23 countries. 

Yet the experience remains both 
powerful and intimate. “There’s al- 
ways a lot of excitement,” says Valerie 
Barr, chair of the computer science de- 
partment at Union College. “There's 
an air of joy and celebration.” Barr 
has attended all but one of the previ- 
ous nine conferences. Like many at- 
tendees, Barr cites the friends she has 
made as one of the primary reasons | 
she comes back year after year. These 
friendships are actively encouraged 
by the conference leadership, who 
challenged women to introduce them- 
selves to five new people each day and 
harnessed social media like Facebook, 
Twitter, and blogs to spread the mes- 
sage. Last year’s innovative CONNECT 
program, which enables attendees 
to use special scannable bar codes to 
automatically exchange contact infor- | 
mation with people, was back again to 
facilitate further networking. 

This year’s theme was “Creating 
Technology for Social Good,” and pan- 
els, papers, and speeches showcased 
diverse examples of collaboration and | 
accomplishment. Keynote speakers 
Megan Smith, vice president of new | 
business development at Google.org, 
and Francine Berman, vice president 
for research at Rensselaer Polytechnic 
Institute, spoke about increasing Inter- 
net availability in developing countries 
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From left, clockwise, Carleton University Ph.D. candidate Natalia Villanueva-Rosales, ACM 
President Dame Wendy Hall, and Google.org Vice President Megan Smith. 


and harnessing data to further projects 
in science and engineering. Other tech- 
nical topics included human-centered 
design, communications security and 
policy, and wireless ad hoc networks. 
“Grace Hopper demonstrates that you 
can approach computer science in a 
way that’s relevant to people—that ad- 
dresses hunger, poverty, environmen- 
tal issues, and so on,” explains Barr. 
Because of the breadth of its scope, 
Barr says, the conference is also a great 
opportunity to learn about breaking re- 
search that’s outside her specialty. 

One of the year’s best-received new 


| features was a technical track that was 


devoted to robotics. “It spanned four 
sessions on a single day and seemed to 
work quite well,” says conference chair 
Heidi Kvinge, an Intel software engi- 
neering manager. 

Students comprise nearly half of the 
attendees, and much of the non-tech- 
nical program revolves around profes- 
sional development. “We always focus 
on the pipeline,” says Kvinge. “We want 
to build the next generation of lead- 
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ers.” There were résumé clinics, leader- 
ship workshops, and panels on issues 
women face in industry and academia. 
The Computer Research Association’s 
Committee on the Status of Women 
sponsored three career-building ses- 
sions for undergraduates, grad stu- 
dents, and early-career researchers. 

Industry representatives were also 
invited to set up booths in an exhibi- 
tion hall to showcase their companies 
and answer questions. “Grace Hopper 
is a great place to recruit people,” says 
Tessa Lau, a researcher at the IBM Al- 
maden Research Center. The women 
at Grace Hopper, says Lau, are moti- 
vated, smart, and enthusiastic about 
the field. The conference, meanwhile, 
gives them a unique opportunity to talk 
to corporate researchers rather than 
professional recruiters. 

“I always come back energized and 
full of ideas,” says Kvinge. “It broadens 
my vision.” 


Leah Hoffmann is a Brooklyn-based technology writer. 
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Broadening Participation 
Opening Remarks 


Highlighting efforts and providing the rationale to increase the participation and 
success of underrepresented groups in computing. 


HIS IS THE first in a new 


Communications Viewpoints | | 


Broadening Participation 


edited by me. The purpose 
of the column is inform, invigorate 
conversation, and inspire action in the 
computing community concerning is- 
sues in broadening participation. This 
column will tend to be U.S.-centric be- 
cause broadening participation in the 
U.S. is what I know best. What I mean 
by broadening participation in the U.S. 
is increasing the inclusion of individu- 
als from underrepresented groups, 
such as women, Native Americans, Af- 
rican Americans, Hispanics, and per- 
sons with disabilities, in the comput- 
ing field at all levels. In other countries 
and regions of the world underrepre- 
sented groups would be defined differ- 
ently. I distinguish “broadening par- 
ticipation” from “diversity,” which is a 
much broader concept. Diversity refers 
to the variety of backgrounds and life 
experiences of individuals including 
geographical, cultural, economic, and 
other differences, regardless of under- 
representation. The computing field 
has tremendous diversity. As an ex- 
ample, more than half of the computer 
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science doctorate degrees in the U.S. 
are earned by people from countries 
other than the U.S. 


My Qualifications 

Readers might wonder why I am quali- 
fied to edit a column on broadening 
participation: I am a white male, not 
' a minority and not disabled. Nonethe- 
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University of Washington Computer Science Professor Richard Ladner, left, signs with 
some of the participants in the first Summer Academy for Advancing Deaf and Hard of 
Hearing in Computing, held during the summer of 2007. 


less, I have a particular sensitivity to 
issues involving broadening participa- 
tion because of my life history as a son 
of deaf parents who were part of the 
deaf community. You might not think 
of deaf people as a minority group but 
the construct of oppressed minority 
group for the deaf community does fit 
in many ways, as brilliantly described 


in Harlan Lane’s book The Mask of Be- 
nevolence: Disabling the Deaf Commu- 
nity. It is an often-overlooked histori- 
cal fact that in 1880 the International 
Congress on Education of the Deaf es- 
sentially banned sign language from 
schools for the deaf worldwide in favor 
of the oral method, which stressed only 
lip-reading and speech. At that time 
there was no realization that sign lan- 
guage was a natural language with its 
own temporal/spatial grammar that 
was efficient and complete in its visual 
modality. Those who banned sign lan- 
guage were not deaf, but believed they 
knew what was best for deaf people by 
promoting their integration into the 
hearing world. 

My parents were very successful as 
teachers of deaf children and in most 
other aspects of life. They were bilin- 
gual, fluent in American Sign Lan- 
guage and English. They were able to 
lip-read and their speech was distort- 
ed, but they were highly educated with 
a remarkable command of English in 
written form. Nonetheless, they always 
felt the sting of paternalism when oth- 
ers treated them as if they couldn’t take 
care of themselves or were not qualified 


for jobs held by hearing people who | 


they knew were less qualified. Future 


columns will explore people with dis- | 
abilities in computing in more depth, | 


but in this first column my intent is to 
provide readers with an understanding 
of background informing my perspec- 
tive on broadening participation. 


Reasons for Broadening 
Participation 

There are a number of reasons why 
broadening participation is important 
to the computing field, the most signif- 
icant of which include numbers, social 
justice, and quality. 

Numbers. In spite of several down- 
turns in the economy since the inven- 
tion of computers and the dot-com 
bust in the early 2000s the demand for 
computer professionals worldwide has 
continued to grow, sometimes extreme- 
ly rapidly and other times more slowly, 
but always growing. The computing 
field cannot continue to rely on getting 
the vast majority of its high-tech work- 
ers from a few demographic groups. As 
an example, the America Competes Act 
of 2007 has provisions and authorizes 
spending to help increase the number 


mote 
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U.S. Ph.D. demographics by gender, race and ethnicity, and disability. 


Computer Science 


Women 21% 


STEM 
40% 


U.S. Population 
50% 


African American 2% 


2% 12% 


Native American 0% 


0% 1% 


Asian 12% 


4% 4% 


Hispanic 1% 


3% 15% 


White (Non-Hispanic) 29% 


42% 74% 


Disabled 1% 


1% 15% 


Non-resident Alien 56% 


of women and minorities in science, | 


technology, engineering, and math 
(STEM) fields. 
Social Justice. An important princi- 


ple in a democratic society is equality, — 
that is, everyone enjoys the same rights | 
and is obligated by the same responsi- | 


bilities. The vast public education sys- 
tem in the U.S. attempts to provide as 


_ many children as possible with a basic 
| education to help them become con- | 
_ tributors to the social well being and | 


to learn their rights and responsibili- 


_ ties as citizens. Through laws enacted 


by legislatures and interpretations of 


26% 6% 


only masks our biases with a cloak of 
rationalization. Everyone has biases 
depending on their own upbringing 
and circumstances. Acting on those 
biases consciously or unconsciously 
may be discrimination, some of which 
could break anti-discrimination laws. 
The Project Implicit (https://implicit. 
harvard.edu/implicit) is a good starting 
point to learn about implicit bias and 
to take a test that demonstrates your 
own implicit biases. 

Quality. The strongest reason for 
broadening participation is quality. 


_ The argument is similar for why diver- 


laws by courts, the U.S. adheres to the | 


principle of equal opportunity whereby 
citizens should not be discriminated 
against because of gender, minor- 
ity status, or disability. Preventing dis- 
crimination does not guarantee pro- 
portional participation, but it does at 
least level the playing field. 

There is well-documented evidence 
of implicit bias against women and mi- 
norities in academia.'” Those of us in 
computer science are typically highly 
rational; perhaps having the belief 
that our reasoning power makes us 
immune to bias. Our reasoning power 


ere 
I distinguish 
“broadening 
participation” 

from “diversity,” 
which is a much 
broader concept. 
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sity is important to quality. William W. 
Wulf, the former president of the Amer- 
ican Academy of Engineers, explained 
this very well: “I believe that engineer- 
ing is a highly creative profession. Re- 
search tells us that creativity does not 
spring from nothing; it is grounded in 
our life experiences, and hence limited 
by those experiences. Lacking diversity 
on an engineering team, we limit the 
set of solutions that will be considered 
and we may not find the best, the ele- 
gant solution.”* 

Better solutions are more likely to 
emerge if there is a diversity of points of 
view contributing to the solutions. This 
is one of the reasons most major com- 
panies try to diversify their work forces: 
it gives them a competitive advantage. 
Broadening participation helps our 
field because it brings in more people 
with a variety of backgrounds, and at 
the same time levels the playing field 
for those who are traditionally under- 
represented. 


Demographics 
It is important to understand the cur- 
rent circumstances with broadening 


| participation, particularly at the end 
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of the pipeline when students receive 
their Ph.D.s. The table here gives some 
approximate percentages for com- 
puter science Ph.D.s, STEM Ph.D.s, 
and the U.S. population. The computer 
science numbers, except for disabled, 
come from the 2008 Taulbee Survey 
(see http://www.cra.org/CRN/articles/ 
may09/taulbee.html), the STEM num- 
bers and disabled number for CS come 
from the 2007 Survey of Earned Doctor- 
ates, and the U.S. population numbers 
come from the 2000 U.S. Census. All 
numbers are rounded to the nearest 
full percent. 

As indicated in the table, there is 
underrepresentation at the Ph.D. level 
by women, African Americans, Na- 
tive Americans, Hispanics, and per- 
sons with disabilities. There has been 
a steady growth in the percentage of 
women earning Ph.D.s in computer 
science from the early 1970s from ap- 
proximately 10% to now just over 20%. 
By contrast, during the same period the 
percentage of women earning Ph.D.s in 
biology rose from less than 10% to now 
just over 50%. 


Broadening Participation Efforts 
The lack ofwomen in computer science 
at all levels has motivated a number 
of efforts including ACM’s Commit- 
tee on Women in Computing (ACM- 
W), Computer Research Association’s 
Committee on the Status of Women 
in Computing Research (CRA-W), the 
Anita Borg Foundation for Women 
and Technology (ABI), and the Nation- 
al Center for Women and Information 
Technology (NCWIT). These organiza- 
tions provide a number of programs 
at all levels to promote and support 
the inclusion of more women in the 
computing field. The Grace Hopper 
Celebration of Women in Computing 
brings women in the computing field 
together every year. 

The Coalition to Diversify Com- 
puting (CDC), which is sponsored by 
ACM, CRA, and IEEE, has the goal to 
increase the participation of under- 
represented minorities in computing. 
The CDC organizes the Tapia confer- 
ence every two years. 

The ACM Special Interest Group on 
Accessible Computing (SIGACCESS) 
has as one of its missions to “educate 
the public to support careers for dis- 
abled persons.” The AccessCom- 
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The computing field 
cannot continue 

to rely on getting 

the vast majority 

of its high-tech 
workers from a few 
demographic groups. 


puting Alliance (see http://www. 
washington.edu/accesscomputing) 
based at the University of Washing- 
ton is a primary resource for students 
with disabilities who are interested 
in computing and for computing | 
departments to make themselves 
more welcoming for such students. 
Nevertheless, there is no organiza- 
tion within the major computing 
organizations (ACM, CRA, and IEEE 
Computer Society) that advocates for | 
broadening participation for persons 
with disabilities. 

Since 1994, the CRA’s A. Nico Hab- 
ermann Award has been awarded to 
people who have “made outstanding 
contributions aimed at increasing the | 
numbers and/or successes of under- 
represented members in the comput- 
ing research community.” The Super- 
computer Conference (SC) is the only 
mainstream conference I know of that | 
includes a section on “Broader Engage- 
ment” that is “aimed at increasing the | 
involvement of individuals who have 
been traditionally underrepresented 
in the high-performance computing 
(HPC) field.” 


Broadening Participation at the 
National Science Foundation 

In 2006, the National Science Founda- | 
tion initiated a new program called 
Broadening Participation in Comput- | 
ing (BPC) within the Division of Com- | 
puter and Information Science and En- 
gineering (CISE). This program funds a 
small number of alliances and a larger 
number of demonstration projects. All 
the projects have the goal of trying to | 
increase the participation and success 
of underrepresented groups in comput- 
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| ing. Some alliances partner research 


universities with minority-serving in- 
stitutions to get more undergraduates 
involved in research. Some alliances are 
regional, taking advantage of locality 
to strengthen the alliance, while oth- 
ers are national, centered around one 
topic such as robotics or students with 
disabilities. Demonstration projects 
are more narrowly focused on develop- 
ing specific interventions to enthuse 


| and build the capacity of students from 


underrepresented groups in the com- 
puting field. A new initiative in BPC that 
started this year are the Leveraging, Scal- 
ing, or Adapting (LSA) projects where a 
project does not have to be a new inter- 
vention, but can be an existing interven- 
tion that has been proven for one group 
and could be applied to another. 


Conclusion 

I hope this premiere Broadening Par- 
ticipation has provided some _ back- 
ground while suggesting future col- 
umns. In this spirit, there have been 
some very encouraging developments 
over the past few years. Frances Allen 
(2006 recipient) and Barbara Liskov 
(2008 recipient) were the first women 


| to win the ACM Turing Award after the 


award had been given to 50 men since 
its inception. The inaugural ACM-Info- 
sys Foundation Award in the Comput- 
ing Sciences was awarded to a woman, 
Daphne Koller, in 2007. The Grace Hop- 
per and Tapia conferences are growing 
and very successful. Broadening par- 
ticipation is beginning to appear as 
a theme in mainstream conferences, 
and this column will contribute to call- 
ing attention to the importance of this 
evolving topic in computing. 
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Emerging Markets 
Israel’s Technology Industry as 
an Economic Growth Engine 


How government-industry collaboration can have far-reaching economic influences. 


VER THE PAST 40 years, Is- 

rael’s economy has trans- 

formed from being closed 

and poor into one that is 

open, developed, and driv- 
en forward by a high-tech sector that is 
well regarded around the world. Much 
of the credit for this can be attributed 
to successive Israeli governments for 
realizing that a civilian research and 
development industry could become 
an engine of economic growth, and 
then implementing policies to create 
it and ensure its expansion. However, 
the government’s actions vis-a-vis the 
high-tech sector—and by extension 
the wider Israeli economy—stand at 
a critical juncture. As strong as the 
industry is, concerns exist about its 
sustainability in the face of the global 
economic slowdown and a weakening 
of the structural components under- 
pinning its success. The government, 
as the achievements of its policies over 
the last 40 years demonstrate, and par- 
ticularly the previous 20, has the ability 
to help surmount these challenges and 
preserve Israel’s technological edge in 
order to ensure renewed progress in 
the decades to come. 


Policy Success 

Israel has implemented numerous 
initiatives and policies to leverage the 
technology sector as a vehicle for eco- 
nomic growth, beginning in 1969 with 
the establishment of the Office of the 
Chief Scientist (OCS), which has be- 


come the main instrument for foster- 


ing innovation and manages several 
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Microsoft Corporation’s Israel Research and Development Center was inaugurated in 2006. 


programs that offer grants and other 
assistance to high-tech companies. 
The state also provides tax breaks and 
additional benefits to attract overseas 
investment in technology firms and 
encourage foreign companies to estab- 
lish operations in Israel. 

The success of government R&D 
policy can be seen ina number of ways. 
On a quantitative basis, Saul Lach of 
the Hebrew University in Jerusalem 
has demonstrated that the return to 
the economy of state investment in 
high-tech ranges from 473% to more 
than 1,000%.* More broadly, the gov- 


a The research was conducted by the research 
firm Applied Economics (http://www.applied. 
co.il) with Saul Lach’s academic supervision. 
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ernment achieved its goal of building 
a robust high-tech sector. From a small 
base even as late as 1993, there are now 
approximately 4,000 high-tech compa- 
nies, one of the highest concentrations 
of such firms outside of Silicon Val- 
ley. These companies employ almost 
250,000 staff, while the segment’s share 
of business sector employment in 2007 
was around 9%. An indicator of the 
strength of the industry is the presence 
of multinational corporations such as 
Intel, IBM, Motorola, and Microsoft, 
which provide experience to their staff 
in managing on a wide scale, complet- 
ing big projects, marketing, and deal- 
ing with large customer accounts. 

The growth in the high-tech sector 
accelerated after 1993 following the 
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implementation of Yozma, a program | 


designed to create a venture capital 
industry and make large-scale start-up 
financing readily available for the first 
time. Since then, VC firms have raised 
$13 billion, mostly from overseas, and 
backed companies that have account- 
ed for $35 billion in exits. Most exits 
have been in the form of initial public 
offerings on exchanges abroad or sales 
to foreign acquirers, with the latter ac- 
celerating the increase in the presence 
of overseas companies in Israel. These 


successes have provided VCs with the | 


returns that have enabled them to raise 
further funds and invest in more high- 
tech businesses, thus ensuring the con- 
tinued development of the sector. 

In addition, the high-tech industry, 


as per the government’s aim, has be- | 


come an engine of economic growth. 
Between 1995 and 2007, while Israel’s 
gross domestic product increased two- 
and-a-half times to $164 billion, high- 
tech GDP more than tripled to approxi- 
mately $18 billion, or more than 10% 
of the total figure. From 1995-2008, 
high-tech exports jumped more than 
fourfold from $5 billion to $23 billion, 
with their contribution to total exports 
growing from 20% to 32% respectively. 
Furthermore, the expansion of the 
technology sector has helped create of 
tens of thousands of jobs in ancillary 
fields such as law firms and accoun- 
tancy practices. 


The Work Force 

There are many reasons why Israel's 
innovation industries have become an 
integral part of the economy. One fac- 
tor is the work force: per capita, Israel 
is among the leading countries in the 
world for the number of engineers, 
Ph.D.s, patents, and citizens with a 
tertiary education, as well as for the 
standard of its research institutions as 
ranked by the World Economic Forum. 
In addition, Israelis enter the work 
force at a higher average age than in 
other countries after having completed 
compulsory national service, where 
they develop problem-solving, leader- 
ship and teamwork skills, and take on 


significant responsibilities. These ele- | 


ments have fostered a culture of inno- 
vation that has led to the pioneering of 
widely used technology such as Voice 
over Internet Protocol (VoIP), Internet 
firewalls, voicemail, Intel’s Centrino 
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A major reason 

why there are so 
few big high-tech 
companies in Israel 
is that many are 
sold, often to foreign 
acquirers, before 
they become large. 


Wi-Fi chips, and the disk-on-key flash 
memory device. The caliber of the work 
force has also played an important role 
in drawing multinational corporations 
and international investment to Israel. 


Challenges 

However, the high-tech sector is vul- 
nerable despite its apparent strength. 
Among the greatest concerns is the 
global economic crisis, which has led 
to a reduction in the sources of credit 
and equity capital. The IPO market 
has stagnated, with not a single Israeli 


| company listing on a foreign exchange 


since 2007, a notable fact given that Is- 
rael has more firms traded on Nasdaq 
than any other country outside North 
America. With revenues falling, busi- 
nesses of all sizes and types are feeling 
the effects of the slowdown, leading to 
layoffs and company closures, and sty- 
mieing R&D projects and the establish- 
ment of new firms. 

Israel is particularly exposed to the 
slump because ofits reliance on exports 
and foreign capital, especially with the 
dollar weakening against the shekel. In 


to a failure to create large companies, 
ensure diversity and spread the ben- 


_ efits of Israel’s high-tech success to the 


wider economy and society. Moreover, 


| Israel’s tertiary education system is 


declining, threatening the basis of the 
country’s technological edge and eco- 
nomic growth—its work force. 


_ Addressing the Challenges 


As great as the challenges facing Is- 
rael are, there is much it can do to ne- 
gate the effects of these difficulties. A 
policy paper for the Israel Democracy 
Institute (IDI) formulated by a team 
chaired by Arnon Bentur of the Tech- 


| nion-Israel Institute of Technology, 


and comprising government econom- 
ic advisors, high-tech business lead- 
ers, leading academics, Chief Scientist 
Eli Opper and one of the writers of this 
column, Orna Berry, has called on the 
country’s ruling coalition to reinvigo- 
rate the government’s involvement 
in the high-tech sector with the aim 
of helping it overcome the economic 
crisis. Such involvement has become 
static in recent years, as pointed out 
in the report The Future of Growth Pro- 


| motion in Israel: A Return to Boosting 


Avant-Garde Industries and Scientific- 
Technological Innovation,” the OCS’s 
2009 budget is only two-thirds of what 
it was in 2000. While the authors of 
the paper understand the spending 
constraints on the government, Israel 
should do its utmost to reverse this 
trend as a strategic priority, especial- 
ly given that state investment in the 
high-tech sector has been shown to 
create wealth and generate returns for 
the economy. 

To help stave off job losses, for ex- 


_ ample, and ensure a continuation of 
_ the innovation that is so vital to Israel’s 


2008, sales abroad accounted for 40% | 


of GDP, while from 1999-2008, the di- 
rect share of overseas financing in to- 
tal VC investments was approximately 
60%, although the overall foreign par- 
ticipation was much greater because 
Israeli VCs raise the vast majority of 
their funds from abroad. Much of this 
capital goes toward early-stage compa- 
nies in the software, communications, 
networking, and semiconductors sec- 
tors and not enough to firms at a later 
stage of development or in other indus- 
tries. These trends have contributed 
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prosperity, the report recommends 
that the government increase its as- 
sistance to companies in reducing the 
costs of R&D projects. To lower the 
dependence on foreign capital, the 
authorities should encourage local in- 
stitutional investment in the high-tech 
sector, which is minimal, through the 
loosening of prohibitive regulation and 
via the provision of tax incentives. 
Another goal that the policy paper 
identifies is the need to encourage 


b See http://www.idi.org.il/sites/english/events/ 
TheAnnualEconomicForum/Documents/ 


the development of large companies, 
defined as employing over 450 work- 


ers and generating annual sales of | 


more than $100 million. Just 1% of 
high-tech businesses in Israel belong 
to this category, and since 1995, only 
four companies have been created 
that have remained independent and 
now satisfy the second criterion. This 
is significant, because the advantages 
of large companies to the economy are 
manifold. One of the most important 
advantages is that because they often 
locate some of their manufacturing 
in different regions in Israel and em- 
ploy workers withall types of back- 
grounds, they spread the benefits of 
the high-tech sector socially and geo- 
graphically. In addition, job creation 
at big corporations is higher than at 
small firms, with OCS figures demon- 
strating that a company experiences a 
substantial growth in the number of 
its employees—from dozens to hun- 
dreds—after it crosses the threshold 


of $100 million in sales a year. Large | 


firms are also more likely to be profit- 
able, contribute to the tax base, less 
likely to close, and have greater re- 
sources for development. 


A major reason why there are so few | 


big high-tech companies in Israel is 
that many are sold, often to foreign ac- 
quirers, before they become large. This 
is partly a consequence of VC financing 
being such a prominent source of cap- 
ital in Israel, and is the disadvantage 
of a model that has otherwise brought 
the country huge benefits. Further- 
more, the size of most Israeli VC 
funds means that late-stage financing 
is mostly beyond their scope, because 
much larger sums of money are need- 
ed than at earlier junctures. There is 
also an almost total lack of post-VC 
funding, and in particular, private- 
equity mezzanine capital. The result 
is that growth companies find it diffi- 
cult to raise the cash required to build 
the operational infrastructure—such 
as in finance, management, manufac- 
turing, sales, and marketing—that en- 
able them to expand. The government 
should fill in this gap by providing the 
necessary resources and reducing the 
risks involved, especially in regard to 
the building of large plants and other 
advanced projects, while simplifying 
the prohibitive bureaucracy associ- 
ated with these processes. 


This fostering of links between the 
private and tertiary sectors should also 
be widened as part of an overall strat- 
egy aimed at reversing a decline in the 
education system that threatens the 
quality of Israel’s future work force 
and hence the long-term status of its 
high-tech industry. Major problems 
include a relative decline in the num- 
ber of R&D university personnel at a 
time that this is rising in Europe and 
Asia, an aging faculty that is not being 
adequately replaced, high staff-student 
ratios, a “brain drain” to the U.S., anda 
fall in the proportion of undergradu- 
ates in science and engineering—sub- 
jects that are vital for entry into high- 
tech positions. 

Immediate action that Israel should 
take to offset these trends, in addition 
to that noted above, includes expand- 
ing the higher-education budget in 
order to recruit new faculty members 
and attract researchers back to Israel 
from abroad; increasing grants for 
doctorates and post-doctorate stud- 
ies; promoting a return to academic 
study among high-tech workers who 
have recently lost their jobs; and wid- 
ening the scope of scientific research 
funds. As with other areas of govern- 
| ment policy, a strengthening of the 

education system would help spread 
the benefits of the high-tech sector 
by facilitating the drawing in of more 
of those from Israel’s lower socioeco- 
nomic echelons. 

As has been demonstrated, Israel’s 
high-tech sector constitutes a major 
part of its economy, so despite a sur- 
feit of problems the government faces, 
it should give high priority to helping 
solve the difficulties the industry is 
experiencing. The government should 
not wait for this growth engine of the 
economy to splutter to a halt, especially 
with international competition for in- 
vestment increasing as countries such 

| as China, India, Finland, and South Ko- 
rea execute strategies to promote their 
own high-tech sectors. 
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Calendar 
of Events 


December 2009 


December 16-18 
International Conference 

on Frontiers of Information 
Technology, 

Abbottabad, Pakistan, 
Contact: Muhammad, Sarfraz, 
Email: prof.m.sarfrax@ 
gmail.com 


December 16-19 
International Conference on 


| High Performance Computing, 
| Kochi, India, 


Contact: Manish Parashar, 
Phone: 732-445-5388, 
Email: parashar@rutgers.edu 


December 16-19 
SIGGRAPH Asia 2009, 
Yokohama, Japan, 
Sponsored: SIGGRAPH, 
Contact: Masahiko Inakage, 
Phone: 81-467-32-7641, 
Email: inakage@ 
media-studio.co.jp 


January 2010 


January 3-7 
23" International Conference 


| on VLSI Design & 9" 


| International Conference on 


Embedded Systems, 
Bangalore, India, 

Contact: Srivaths Ravi, 
Email: srivaths.ravi@ti.com 


January 9-14 

ACM SIGPLAN Principles and 
Practice of Parallel Computing, 
Bangalore, India, 

Sponsored: SIGPLAN, 

Contact: David Padua, 

Phone: 217-333-4223, 

Email: dapadua@gmail.com 


January 18-22 

The Twelfth Australasian 
Computing Education 
Conference, 

Brisbane, Australia, 
Contact: Tony G Clear, 
Phone: 64-9-917-9999, 


Email: tony.clear@aut.ac.nz 


January 20-23 
International Conference 
on Biomedical Engineering 
Systems 

and Technologies, 
Valencia, Spain, 

Contact: Joaquim B. Filipe, 
Phone: 351-91-983-3996, 
Email: jfilipe@insticc.org 
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The Profession of IT 
Computing’s Paradigm 


Trying to categorize computing as engineering, science, or math is fruitless; 


we have our own paradigm. 


OMPUTING RIGHTFULLY COMES up 
in many discussions of uni- 
versity organization and cur- 


ricula, high school courses, | 
job qualifications, research | 


funding, innovation, public policy, and 
the future of education. In repeated at- 
tempts to characterize our field in these 
discussions, our leaders continue to en- 
counter sometimes contentious debate 
over whether computing is a field of en- 
gineering or science. Because it leaves 
others with a sense that we lack a clear 
focus, that debate negatively influences 
policies involving computing. 

There seems to be agreement that 
computing exemplifies engineering and 
science, and that neither engineering 
nor science characterizes computing. 
What then does characterize comput- 
ing? In this column, we will discuss 
computing’s unique paradigm and of- 
fer it as a way to leave the debilitating 
debate behind. 

The word “paradigm” for our pur- 


poses means a belief system and its as- | 


sociated practices, defining how a field 
sees the world and approaches the solu- 
tions of problems. This is the sense that 
Thomas Kuhn used in his famous book, 
The Structure of Scientific Revolutions. 
Paradigms can contain sub-paradigms: 
thus, engineering divides into electri- 
cal, mechanical, chemical, civil; science 


divides into physical, life, and social sci- | 


ences, which further divide into sepa- 
rate fields of science. 


Roots of the Debate 
Whether computing is engineering or 


itself. Some founders thought the new 
field a branch of science, others en- 
gineering. Because of the sheer chal- 
lenge of building reliable computers, 
networks, and complex software, the 
engineering view dominated for four 
decades. In the mid-1980s, the science 
view began to assert itself again with 
the computational science movement, 


which claimed computation as a new | 


sub-paradigm of science, and stimu- 
lated more experimental research in 
computing. 

Along the way, there were three waves 
of attempts to provide a unified view. 
The first wave was by Alan Perlis,° Allen 
Newell,’ and Herb Simon,'' who argued 
that computing was unique among all 
sciences and engineering in its study of 
information processes. Simon went so 
far as to call computing a science of the 
artificial. 

The second wave started in the late 
1960s. It focused on programming, 
seen as the art of designing information 
processes. Edsger Dijkstra and Donald 


science is a debate as old as the field | Knuth took strong stands favoring pro- 
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| gramming as the unifying theme. In 
recent times, this view has foundered 
_ because the field has expanded and the 
public understanding of programmer 
has become so narrow (a coder). 
The third wave was the NSF-spon- 
| sored Computer Science and Engineer- 
ing Research Study (COSERS), led by 
Bruce Arden in the mid-1970s. It defined 
computing as automation of informa- 
| tion processes in engineering, science, 
_ and business. It produced a wonderful 
| report that explained many exotic as- 
pects of computing to the layperson.! 
However, it did not succeed in reconcil- 
ing the engineering and science views 
of computing. 


| Peaceful Coexistence 
In the mid-1980s, the ACM Education 
Board was concerned about the lack of 
a common definition of the field. The 
Board chargeda task force to investigate; 
its response was a report Computing as 
a Discipline.’ The central argument of 
the report was that the computing field 
was a unique combination of the tradi- 
tional paradigms of math, science, and 
engineering (see Table 1). Although all 
three had made substantial contribu- 
| tions to the field, no single one told the 
| whole story. Programming—a practice 
that crossed all three paradigms—was 
essential but did not fully portray the 
depth and richness of the field. 

The report in effect argued for the 
| peaceful coexistence of the engineer- 
ing, science, and math paradigms. It 
found a strong core of knowledge that 
supports all three paradigms. It called 
on everyone to accept the three and not 


try to make one of them more important 
than the others. 

Around 1997, many of us began to 
think the popular label IT (information 
technology) would reconcile these three 


parts under a single umbrella unique | 


to computing.*’ Time has proved us 
wrong. IT now connotes technologi- 
cal infrastructure and its financial and 
commercial applications, but not the 
core technical aspects of computing. 


A Computing Paradigm 
There is something unsatisfying about 
thinking of computing as a “blend of 
three sub-paradigms.” What new para- 
digm does the blend produce? 

Recent thinking about this question 


has produced new insights that, taken | 


together, reveal a computing paradigm. 
A hallmark of this thinking has been 
to shift attention from computing ma- 
chines to information processes, in- 
cluding natural information processes 


Math 


Characterize objects 
of study (definition) 


Initiation 


Science 


Observe a possible 
recurrence or pattern of 
phenomena (hypothesis) 


_ databases, graphics, artificial intelli- 


such as DNA transcription.”® The great | 
principles framework interprets com- 
puting through the seven dimensions 
of computation, communication, co- 
ordination, recollection, automation, 
evaluation, and design (see http:// 
greatprinciples.org). The relationships 
framework interprets computing as a 
dynamic field of many “implementa- 
tion” and “influencing” interactions.’? | 
There is now a strong argument that 
computing is a fourth great domain of 
science alongside the physical, life, and 
social sciences.” 

These newer frameworks all rec- 
ognize that the computing field has 
expanded dramatically in the past 
decade. Computing is no longer just 
about algorithms, data structures, nu- 
merical methods, programming lan- 
guages, operating systems, networks, 


gence, and software engineering, as it 
was prior to 1989. It now also includes 


Engineering 


Create statements 
about desired system 
actions and responses 
(requirements) 


Conceptualization Hypothesize possible re- 


lationships among objects 


Construct a model that 
explains the observation 


Create formal statements 
of system functions and 


(theorem) and enables predictions _ interactions (specifica- | 
(model) tions) 
Realization Deduce which relation- Perform experiments and Design and implement | 
ships are true (proof) collect data (validate) prototypes (design) 
Evaluation Interpret results Interpret results Test the prototypes 
Action Act on results (apply) Act on results (predict) Act on results (build) 


Table 2. The computing paradigm. 


Computing 


Initiation 


Determine if the system to be built (or observed) can be | 
represented by information processes, either finite (terminating) 
or infinite (continuing interactive). 


Conceptualization 


Design (or discover) a computational model (for example, 
an algorithm or a set of computational agents) that generates 
the system's behaviors. 


Realization 


Implement designed processes in a medium capable of executing 
its instructions. Design simulations and models of discovered 
processes. Observe behaviors of information processes. 


Evaluation 


Test the implementation for logical correctness, consistency 
with hypotheses, performance constraints, and meeting original 
goals. Evolve the realization as needed. 


Action 
evaluation. 


a LE 


Put the results to action in the world. Monitor for continued 


viewpoints 


..tiCéSCCCtCs 
There is an 
interesting 

distinction between 
computational 
expressions and the 
normal language of 
engineering, science, 
and mathematics. 


exciting new subjects including Inter- 
net, Web science, mobile computing, 


| cyberspace protection, user interface 


design, and information visualization. 
The resulting commercial applications 
have spawned new research challenges 


| in social networking, endlessly evolving 


Table 1. Sub-paradigms embedded in computing. | 


computation, music, video, digital pho- 
tography, vision, massive multiplayer 
online games, user-generated content, 
and much more. 

The newer frameworks also recog- 
nize the growing use of the scientific 
(experimental) method to understand 
computations. Heuristic algorithms, 
distributed data, fused data, digital fo- 
rensics, distributed networks, social 
networks, and automated robotic sys- 
tems, to name a few, are often too com- 
plex for mathematical analysis but yield 


| to the scientific method. These scien- 


tific approaches reveal that discovery is 
as important as construction or design. 
Discovery and design are closely linked: 
the behavior of many large designed 
systems (such as the Web) is discovered 
by observation; we design simulations 
to imitate discovered information pro- 
cesses. Moreover, computing has devel- 
oped search tools that are helping make 
scientific discoveries in many fields. 
The newer frameworks also recog- 


| nize natural information processes in 
| many fields including sensing and cog- 


nition in living beings, thought process- 
es, social interactions, economics, DNA 
transcription, immune systems, and 
quantum systems. Computing concepts 


enable new discoveries and understand- 


ings of these natural processes. 
The central focus of the comput- 


| ing paradigm can be summarized as 


DECEMBER 2009 


VOL. 52 


NO. 12 COMMUNICATIONS OF THE ACM 29 


viewpoints 


ACM 
Journal on 
Computing and 
Cultural 
Heritage 


ournal on computing and cultural heritage 


ee?e¢¢?¢ 


JOCCH publishes papers 
significant and lasting value in 
all areas relating to the use of ICT 


of 


in support of Cultural Heritage, 
seeking to combine the best of 
computing science with real 
attention to any aspect of the 
cultural heritage sector. 


ee¢o¢¢ 


www.acm.org/jocch 
www.acm.org/subscribe 


Association for 
Computing Machinery 


30 COMMUNICATIONS OF THE ACM 


DECEMBER 2009 


information processes—natural or 
constructed processes that transform 
information. They can be discrete or 
continuous. 

Computing represents informa- 
tion processes as “expressions that do 
work.” An expression is a description of 
the steps of a process in the form of an 
(often large) accumulation of instruc- 
tions. Expressions can be artifacts, such 
as programs designed and created by 
people, or descriptions of natural occur- 
rences, such as DNA and DNA transcrip- 
tion in biology. Expressions are not only 
representational, they are generative: 
they create actions when interpreted 


_ (executed) by appropriate machines. 


Since expressions are not directly 
constrained by natural laws, we have 
evolved various methods that enable us 
to have confidence that the behaviors 


| generated do useful work and do not 


create unwanted side effects. Some of 
these methods rely on formal mathe- 
matics to prove that the actions generat- 
ed by an expression meet specifications. 
Many more rely on experiments to vali- 
date hypotheses about the behavior of 
actions and discover the limits of their 
reliable operation. 

Table 2 summarizes the computing 


| paradigm with this focus. While it con- 


tains echoes of engineering, science, 
and mathematics, it is distinctively dif- 
ferent because of its central focus on 
information processes.° It allows engi- 


| neering and science to be present to- 


gether without having to choose. 

There is an interesting distinction 
between computational expressions 
and the normal language of engineer- 
ing, science, and mathematics. Engi- 
neers, scientists, and mathematicians 


| endeavor to position themselves as out- 


side observers of the objects or systems 
they build or study. Outside observers 
are purely representational. Thus, tradi- 
tional blueprints, scientific models, and 
mathematical models are not execut- 
able. (However, when combined with 
computational systems, they give auto- 
matic fabricators, simulators of mod- 
els, and mathematical software librar- 
ies.) Computational expressions are not 
constrained to be outside the systems 
they represent. The possibility of self- 
reference makes for very powerful com- 
putational schemes based on recursive 
designs and executions, and also for 
very powerful limitations on comput- 
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| ing, such as the noncomputability of 


_ halting problems. Self-reference is com- 


mon in natural information processes; 
the cell, for example, contains its own 
blueprint. 

The interpretation “computational 
thinking” embeds nicely into this 
paradigm. The paradigm describes not 
only a way of thinking, but a system of 


| practice. 


Conclusion 
The distinctions discussed here offer 
a distinctive and coherent higher-level 


| description of what we do, permitting 


us to better understand and improve 
our work and better interact with peo- 
ple in other fields. The engineering- 
science debates present a confusing 
picture that adversely affects policies 
on innovation, science, and technology, 
the flow of funds into various fields for 
education and research, the public per- 
ception of computing, and the choices 
young people make about careers. 

We are well aware that the comput- 
ing paradigm statement needs to be 
discussed widely. We offer this as an 
opening statement in a very important 
and much needed discussion. 
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Broken Builds 


Frequent broken builds could be symptomatic 
of deeper problems within a development project. 


Dear KV, 

Is there anything more aggravating to 
programmers than fellow team mem- 
bers checking in code that breaks a 
build? I find myself constantly track- 
ing down minor mistakes in other 
people’s code simply because they 
didn’t check that their changes didn’t 
break the build. The worst part is when 
someone has broken the build and 
they get indignant about my pointing 
it out. Are there any better ways to pro- 
tect against these types of problems? 

Made to be Broken 


Dear Made, 

I know you, and everyone else, are ex- 
pecting me simply to rant about how 
you should cut off the tips of the pin- 
kies of the offending parties as a les- 
son to them and a warning to others 
about carelessness. While that might 
be satisfying, it’s illegal in most places 
and, I’m told, morally wrong. 

A frequently broken build is a 
symptom of a disease, but it is not the 
disease itself. It indicates problems in 
any of the following three areas: man- 
agement, infrastructure, or software 
architecture. 

Management is the area that most 
quickly comes to mind when there 
is a team- or project-wide problem. 
The belief of most of the workers on 
a project—those tasked with writing 
and verifying code and systems—is 


that project-wide problems need to | 


be solved by Mommy (aka the project 


lead or the manager). Unfortunately, | 


Mommy can remind people only so 
often to clean up their rooms, to tie 
their shoes, and not to check in bro- 
ken code. 

One of the best solutions to the 
problem of people not checking their 
code before they check it in is peer 
pressure. Anyone who checks in code 
without compiling it first ought to feel 
embarrassed by such a mistake, and 
if not, the other people around them 
should strongly encourage them to 


feel embarrassed. Shame, it turns out, | 
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is a strong motivator for avoiding an- 
tisocial behavior. Like many—or per- 
haps all—of KV’s suggestions, sham- 
ing can be taken too far, but I suggest 
you try it and see how it works. 

Depending on Mommy to tell off the 
misbehaving kids becomes tiresome 
both for you and the project manage- 
ment after a while. What you want to 
see is a good working culture develop, 
one in which people know that break- 
ing the build is like taking a nap in the 
middle of the break room; funny once, 
but usually unacceptable. 

Poor infrastructure can also lead 
to suffering with frequently broken 
builds. One thing that continues to 
amaze me is how computer hardware 
gets cheaper, and yet companies con- 
tinue to coast along without a nightly, 
or more frequent, build system. For 


_ the price of a single desktop comput- 


er and a few days of scripting, most 
teams can have a system that periodi- 
cally updates a test build of their code, 
builds it, and sends email to the team 
if the build fails. The amount of time 
saved by such a system is easily mea- 
surable: subtract 1 from the number 
of programmers on a team and multi- 
ply the resulting number by the num- 
ber of hours it usually takes to figure 
out who broke the build, find them, 
shame them, and have them fix the 
build. Now multiply that number by 
the average hourly wage of each per- 
son on the team, and you have an ap- 
proximate idea of how much time and 
money was wasted by not having peri- 
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If all the 
components of a 


| software system 


are too inter- 
dependent, then 


_a change to one 


can result in 
an injury to all. 


odic builds. I won’t get into periodic 
testing, which can save even more 
time and money, because if your build 
is always broken, you clearly have not 


| achieved a sufficient level of sophisti- 


cation to move on to nightly tests. 
Even though the broken code will 


| still get into the system, with a peri- 
| odic build system the offending per- 


son will find out fairly quickly that he 
or she broke the build and hopefully 


| will admit it in an email message (“I 
| broke the build, hang on a second)” 


and then repair the error. While this 
is still suboptimal, it is far better than 
what you had before. 

Sometimes it is the build system it- 
self that is the source of the problem. 
Many modern build systems depend 


_ heavily on caching derived objects, 


as well as the parallelization of the 
build process. While a parallel build 
process can provide you results more 


| quickly, it can often lead to build fail- 


ures that are false positives. Trying to 
build an object that requires another 
object to be created first, such as an 
automatically created include file, 
always leads to trouble. Maintaining 


| tate to use the word architecture since 


overuse of the term has led to the un- 
fortunate proliferation of the job title 
software architect, which is far too of- 
ten a misnomer. 

If all the components of a software 
system are too interdependent, thena 
change to one can result in an injury 
to all. A lack of sufficient modulariza- 
tion is often a problem when software 
ships, but it is definitely a problem 
when the software is being compiled. 
When a change to an include file in 
one area leads to the build breaking 
in another area, then your software is 
probably too heavily interlinked, and 
the team should look at breaking the 
pieces apart. Often such links come 
from careless reuse of some part of 
the system. Careless reuse is when you 
look at a large abstraction and think, 
“Oh, I really want this version of meth- 
od X,” where X is a small part of the 
overall abstraction, and then you wind 
up making your code depend not just 
on the small part you want, but on all 
of the parts that X is associated with. 
If you get to the point where you know 
it’s neither carelessness nor poor in- 
frastructure that is leading to frequent 
build failures, then it’s time to look at 
the software architecture. 

Now you know the three most basic 
ways to alleviate frequent build break- 
age: shaming your teammates, adding 
some basic infrastructure, and finally 
improving the software architecture. 
That ought to keep you out of jail, for 
now. 


KV 
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the list of dependencies by hand is an | 
error-prone, but often necessary, pro- | 


_ cess. If you are using a build system 


that depends on caching and uses par- 
allel builds, then your problems may 
lie here. 

Now we come to the final area that 
is the cause of build problems. The 
way in which a piece of software is put 


| together, frequently referred to as its 


architecture, often impacts not only 


| how the software performs when it 


runs, but also how it is built. I hesi- 
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Savas Parastatidis, Evelyne Viegas, and Tony Hey 


Viewpoint 


A “Smart” Cyberinfrastructure 
for Research 


A view of semantic computing and its role in research. 


HE WEB HAS emerged as the | 


largest distributed infor- 

mation repository on the 

planet. Human knowledge 

is captured on the Web in 
various digital forms: Web pages, news 
articles, blog posts, digitized books, 
scanned paintings, videos, podcasts, 
lyrics, speech transcripts, and so forth. 
Over the years, services have emerged 
to aggregate, index, and enable the 
rapid searching of all this digital data 
but the full meaning of that data may 
only be interpretable by humans. In 
the common case, machines are inca- 
pable of understanding or reasoning 
about the vast amounts of data avail- 


able on the Web. They are not able to | 


interpret or infer new information 
from the data and this has been a topic 
of active research interest for decades 
within the artificial intelligence com- 
munity. While the dream of artificial 
intelligence—machines capable of 
human-level reasoning and under- 
standing—may still not be within our 


grasp, we believe semantic technolo- | 


gies hold the promise that machines 
will be able to meaningfully process, 


combine, and infer information from 


the world’s data in the not-too-distant 
future (see Figure 1). 


The Web ecosystem of simple for- | 


mats and protocols is an example of 
how we can effectively manage, share, 
access, and represent large amounts 


of data. Companies like Microsoft and | 


Google are building large-scale servic- 
es (such as search and cloud services) 
leveraging the existing hardware and 


software infrastructures. Schema 
languages, XML, Entity Data Models, 
Microformats, RSS, Atom, RDF (see 
http://www.w3.org/RDF/), OWL (see 
http://www.w3.org/2007/OWL/), and 
other technologies are being used to 
| capture the information in data while 
machine learning, entity extraction, 
neural networks, clustering, and la- 
_ tent semantics are approaches to ex- 
| tracting information from that data 
and help reason about it. The field is 
an active area of research and experi- 


(see the sidebar “Semantic Comput- 
ing” vs. “Semantic Web”). 


Data Mesh 

At the center of our discussion is the 
concept of a “data mesh,” a term we 
use to refer to the various information 
and knowledge representation tech- 
niques/technologies that have been 
developed over the years (see Figure 
_ 2). In its simplest form, a data mesh 
_ looks like a directed graph in which 
the nodes represent data/informa- 
tion captured in well-known formats 


Data 


Current technol 
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mentation and is still rapidly evolving | 
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and the edges capture a relationship, 
characterized by a predicate and per- 
haps other information, between the 
linked data. For example, “Jane listens 
to Santana every day” is a relation- 
ship, in which “Jane” (the subject) and 
“Santana” (the object) are the nodes, 
“listens to” is the edge (the predicate), 


_ and “every day” is an attribute of the 


edge. Other tuples could add further 
information to the data mesh (for ex- 
ample, “Santana is an artist,” “Santa- 
na plays the guitar,” “Santana makes 
music,” “Jane met Santana in 1995” 
and so forth). Semantic Web’s RDF is 


| one, but not the only, technology that 


can be used to represent such graphs 
or knowledge bases. Indeed, Cyc,’ Se- 
mantic Networks, WordNet,’ Multi- 
Net® are examples of other such tech- 


_ nologies/approaches. Scaling to the 


same level as the Web still remains a 
challenge for these approaches. 

We believe there is an opportunity 
to involve users, who are now equally 
producers as they are consumers of 
information on the Web, and not 
just the very few experts in producing 


Knowledge 


Figure 1. Data, information, knowledge: While we are good at data management at scale (for 
example, Google, Amazon) we are still far away from supporting information representation 
and reasoning. Knowledge management at scale is a great opportunity for innovation. 
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Figure 2. All data/information can be linked in a mesh through relationships. Common, 
machine-processable formats are used to represent every aspect of a data mesh. 


structured data at Web scale. Recent 
success stories in the application of 
knowledge representation to specific 
domains, as in the ™Grid (see http:// 
www.mygrid.org.uk/) work in BioIn- 
formatics research,'° demonstrate the 
potential benefits of semantic com- 


puting technologies. Here, we use the | 
term “data mesh” to encompass the | 


various concepts and approaches that 
could be used or combined to sup- 


port a semantics-rich ecosystem of re- | 


search tools and services. It is not our 
intention to suggest there would be 
one single data mesh that would rep- 
resent all human knowledge. 

We expect a great number of vocab- 
ularies to emerge, many of which will 
overlap, for representing every aspect 
of a data mesh (such as geo-location, 
mood, reviews, personal information, 
domain-specific concepts and terms). 
Ontologies will support an evolving 
ecosystem of facts, vocabularies, and 
relationships in specific domains. 
We are already witnessing a plethora 
of emerging efforts to standardize on 
such vocabularies, such as microfor- 
mats (http://www.microformats.org/), 
data portability (http://www.dataport- 
ability.org/), gene ontology (http:// 
www.geneontology.org/), and others. 

Programs will consume, combine, 
and correlate everything in the uni- 
verse of structured information and 
help users reason over it. They will al- 
low them to ask questions against this 


as “Which is the most popular book 
among my friends today?,” “Who is 


the expert on aspect A of my business | 
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workflow inside my organization?,” 
“Have Evelyne and Savas been at the 


same conference, at the same time in | 
any point in time?,” “What’s the de- | 


gree of separation in terms of citations 
between my paper and the seminal 
work by Jim Gray?” and so on. 

While data mesh instances can 
be built in isolation (as in many of 
today’s social networks), we believe 
the potential value of aggregating all 
of them and combining them in one 
huge network of facts is tremendous. 
This idea is similar to Tim Berners- 
Lee’s more recent rhetoric around 


the ‘Giant Global Graph of Facts’ (see | 


http://dig.csail.mit.edu/breadcrumbs/ 
node/215). Please note that we are not 
suggesting there would be a single re- 
pository of facts or that there would 


| even be universal agreement on what 


is represented. We do expect, however, 
to see machine-based technologies 


Lea: 
We are already 
witnessing the 
emergence of data 
mesh instances 

on the Web, 
especially as 

they relate to 


| social networks. 


(global) collection of facts—informa- | 
tion access policies permitting—such | 
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that would be able to reason, many 


_ times using probabilistic-based tech- 


niques, over the diverse set of facts. 
We are already witnessing the emer- 
gence of data mesh instances on the 
Web, especially as they relate to social 
networks. The Zune Social (http://so- 
cial.zune.net/) is an example of how a 


| social network can be combined with 


information about music preferences, 
recommendations, and an online mar- 


| ketplace. Facebook (http://www.face- 


book.com/) is another example of how 
connections between identities can 
help in aggregating user-oriented pref- 


| erences and then inferring behavior 


and preference statistics. Finally, Pow- 


| erset (http://www.powerset.com/) is an 


example of a search service that lever- 
ages existing structured information, 
for example, Freebase (http://www. 
freebase.com/) or generates it from un- 
structured sources (such as by applying 
natural language processing technolo- 
gies on Wikipedia content) to improve 
the quality of the query results. 

We believe that over time, a huge 
ecosystem of services and tools will 
emerge around data mesh instances. 
Such tools and services will allow us to 
move beyond current practice of infor- 


mation management by incorporating 


more automation. Recommendation 
engines will be the norm and our in- 
teractions with computers will always 
be context-aware (for example, “since 
the topic of the paper being written is 
about botany, a query about ‘bush’ is 
unlikely to be about a person’s name” 
or “the search about papers on orchid 


| will take into consideration the opin- 


ion of people in the user’s professional 
social network”). While today we can 
search for information over the global 
graph of linked Web pages consisting 
of predominately unstructured data, 
in the future we will be able to search 
over all types of semantically enriched 
information, which will in turn enable 
a wide range of new applications to 
emerge such as recommendation ser- 
vices, information management auto- 
mation, information inferencing, and 
so forth. 


Tools and Services to 

Support Research 

We believe the research community 
will play a central role in supporting 
and further evolving the semantic 


computing vision. We should not only | 
become early adopters of semantic | 
computing technologies and _ infra- 
structure in our research projects but 
we should also actively develop and 
evolve them. In Microsoft Research we 
are taking some first steps toward this 
vision, as we are investing in projects 
that can demonstrate the benefits of 
semantic computing technologies in 
research. We are therefore attempt- 
ing to build an ecosystem of research 
tools and services as demonstrations 
of these ideas and concepts. 

We focus here on the role of the re- 
searcher as an “extreme information 
worker” meaning a technology user 
with expectations and requirements 
at a scale not yet required by the busi- 
ness community. We believe informa- 
tion representation, management, 
and processing tools in combination 
with automation technologies will 
greatly help them in their research. | 
We are therefore taking small steps 
toward developing semantics-aware 
tools and services. Here, we describe 
some of the work we are doing in sup- 
porting the scholarly communications 
life cycle through semantic computing 
technologies. 

Semantic Annotations and Metadata 
in Word. The authoring stage is perhaps 


Domain-specific ontology 


_ in the scientific world. In many disci- 


Support for annotations 


Natural language 
may not always | 
be adequate 

to convey the 
meaning of a word 
or an expression, | 
especially in the 
scientific world. 


the best time to capture an author’s 
intentions and to record the meaning 
of the words as they are being writ- 
ten. Natural language may not always | 
be adequate to convey the meaning 
of a word or an expression, especially 


plines domain-specific ontologies are 
therefore being created by experts to 
address this issue but they have not so 
far been incorporated with productiv- 
ity tools like Microsoft Office. 

In collaboration with Phil Bourne 
and Lynn Fink at the University of 
California, San Diego, we worked to- 


viewpoints 


| ward a plug-in for Word 2007 (part of 


the BioLit project; http://biolit.ucsd. 
edu/) that allows authors to annotate 
words or sentences with terms from 


| an ontology (for example, Gene Ontol- 


ogy; _ http://www.geneontology.org/). 
The annotations are stored as part of 
the Office Open XML (OOXML) repre- 
sentation of the document (OOXML 
has been accepted as an ISO standard. 
More information can be found at 
http://openxmldeveloper.org/). Tools 


and services can now extract the an- 


notations by just opening the OOXML 
package without human intervention 
and there is not even a need for Word 
to be installed. As a result, the docu- 
ments will be able to be better catego- 
rized, indexed, and searched with the 
author’s intent always closely associ- 
ated with the text. 

The ability to easily annotate terms 
from within Word is a first step in pro- 
ducing documents that semantically 
relate to the body of knowledge in a 
domain. In this way, information can 
easily become part of a data mesh as it 
is being generated (see Figure 3). The 
source code for the plugin is now avail- 
able as open source (see http://ucsdbi- 
olit.codeplex.com/) for the community 
to further extend or just use as the ba- 
sis for a new generation of semantics- 


_ oriented authoring tools. 


Annotations travel with the 
document 


Can be used to improve 
domain-specific discovery of 
information, cross-linking, 


straight from within Word 


Figure 3. Semantic annotations in Word. 


Chemistry in Word. We are inves- 


_ tigating, in collaboration with Peter 
| Murray-Rust, Jim Downing, and Joe 


Townsend from the University of Cam- 
bridge, the introduction of chemistry 
drawing functionality into Word docu- 
ments (see http://research.microsoft. 
com/en-us/projects/chem4word/). 

Rather than just having images of 
chemical structures, we would like to 
preserve the chemistry-related seman- 
tics in a machine-processable man- 


ner. For that reason, we are using the 
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Figure 4. A simple “Chemistry Zone” in a Word document and the CML representation 


(in pseudo-XML) stored inside the OOXML document. 
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Chemistry Markup Language (CML) in 
our investigations; instances of CML 
would be embedded inside OOXML 
documents. We believe an ecosystem 
of chemistry-related tools and services 
can then emerge to enable the auto- 
matic processing of documents, mak- 
ing the authoring process an easy but 
increasingly valuable part of the re- 


_ search life cycle. 


As an example, consider the water 
molecule (H,O). In a Word document, 


| it appears as a series of characters, 
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one of which is a subscript. Through 
this project, it will be possible to also 
store the structured representation of 
water so that programs can discover 
it. Figure 4 shows how some part of a 
document can be identified as chemis- 
try. The tool will automatically save the 
CML representation of the identified 
region (1D and 2D representations 
and authoring functionality will also 
be supported; see Figure 5). The use of 
a semantically rich data format to rep- 
resent domain-specific information is 
another step in producing structured 


data that automatically becomes part | 


of the data mesh. 

Zentity—A Repository Platform. The 
need for quality, well-engineered, and 
documented software infrastructures 
to support institutional repositories, 
archives, and digital libraries is in- 
creasing, especially in the context of 


the global initiative toward Open Ac- | 


cess.*? We have developed a platform, 
called Zentity, to support repository 
systems based on_ product-quality 
technologies like SQL Server, .NET 3.5, 
and the Entity Framework (see http:// 
research.microsoft.com/en-us/proj- 
ects/zentity/). 

The Zentity platform supports a 
graph-based representation of the 
data in a repository. It provides an 
easy-to-use application programming 
interface that abstracts the use of the 
underlying relational system to man- 
age digital resources and the rela- 
tionships between them and creates 
a data mesh. Initially, the platform 
will be targeted toward the “research 
output repository” domain, offering 
a data model capable of capturing 
the research-related resources (for 
example, papers, reports, theses, pre- 
sentations, and data sets) of an orga- 
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Figure 5. A more complicated chemical structure from an early prototype of our tool. 


nization. However, Zentity has been 
designed to support the data models 
of arbitrary domains (for example, 
museums, art collections, research 
data, and so forth). 

Interoperability is a major focus of 
the project and we are implementing 
support for popular Semantic Web 
technologies, like RDF and RDFS. We 
are also building a number of tools 
and services to operate against the 
data mesh created and we hope that 
more will be developed by the com- 
munity as we make the platform freely 
available. 

In addition to supporting the re- 
pository community through prod- 
uct-quality technologies, our work on 
Zentity attempts to demonstrate some 
of the principles of data meshes. The 
data model employed for the imple- 


es ae ee ei eee eS ca Ck am ene 
“Semantic Computing” 
vs. “Semantic Web” 


We make a distinction between the general approach of computing based on 
semantic technologies (machine learning, neural networks, ontologies, inference, 
and so forth) and the “Semantic Web” as described in Berners-Lee et al.’ and 
Shadbolt et al.’ and which is the term used to refer to a specific ecosystem of 
technologies, like RDF and OWL. The Semantic Web has gained a lot of attention 
lately, bringing more awareness of the importance of semantics. However, we 
consider the Semantic Web technologies to be just some of the many tools at our 
disposal when building semantics-based solutions. 
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mentation promotes the graph repre- 
sentation principles discussed earlier 
and illustrated in Figure 2. 

Social Networking and Data Meshes. 
As a final example, we examine the 
relationship between social networks 
and other structured information. We 
consider the former a special case of 
a data mesh (in which the nodes are 
people and the edges between them 
represent human relationships, such 

s “friend” or “colleague”). A social 
network can provide context for the 
interactions between people (for ex- 
ample, “the botany domain-specific 
social network”); it can be used to infer 
information about a community (for 
example, “the botany community has 
been actively looking at orchids over 
the last week”); it can be used to pro- 
vide recommendations (for example, 
“the most read article about orchids 
can be found at location X”); it can be 
used to change or supplement the way 
peer reviewing is done and merit is giv- 
en (for example, “the article posted on- 
line on orchids last week has received 
great reviews from the experts of the 
botany community”). 

One can therefore imagine an 
ecosystem of tools and services that 
takes advantage of the relationships 
between researchers in the context of 
a particular research discipline, col- 
laboration or research project and 
their activities, documents, and opin- 


ions. Typical research-related pro- 
cesses could be augmented or even 
completely supplanted. For example, 
researchers could automatically get 
recommendations of papers and con- 
tacts based on what they are currently 


doing; experts might be automatically | 


identified in a domain based on dis- 
cussions around their papers and blog 
entries; peer reviewing could evolve to 
take into consideration the new social 
media and Web-based interactions; 
and even ‘impact factors’ for institu- 
tions might incorporate electronic 
analysis of all types of information 
and not just citations to publications 
and research grants. 

Our support of the "YExperiment 
(http://www.myexperiment.org/) proj- 
ectisa demonstration of our belief that 
scientific collaboration and informa- 
tion sharing can be supported through 
social networking. The "YExperiment 
project brings together social net- 
works and workflows in a single infor- 


mation graph—a data mesh—that can | 


be browsed, analyzed, and searched. 


Conclusion 
As researchers and scientific instru- 
ments can now produce and publish 


large amounts of data and informa- | 


tion more easily than at any other 
point in history, there is an increasing 
requirement for automation tools to 
help manage and navigate the deluge 
of research data. For example, projects 
like Pan-STARRS (http://pan-starrs.ifa. 
hawaii.edu/) and the HLC (http://Ihe. 
web.cern.ch/) will generate many peta- 


bytes of data. The emergence of folk- | 


sonomies on the Web is one example 
of how user-driven categorization can 
help with information discovery. The 


rr 
There is an increasing 
requirement for 
automation tools 

to help manage and 
navigate the deluge 

of research data. 


We need to 

invest significant 
resources to 

making the semantic 
computing vision 

a reality. 


need to deal with meaningful and rel- 


evant information within the context 
of one’s actions is growing. There is an 
immense opportunity for the research 
community to bring its expertise and 
experience together in accelerating 


the development of semantic comput- © 


ing technologies. We need to invest 
significant resources to making the se- 
mantic computing vision a reality by: 

> investing in semantics-aware in- 
frastructure; 


> increasing awareness of the po- | 


tential of semantics-based computing; 
and 
> training more researchers on se- 


mantics-based computing and related | 


technologies. 

The discussion on data meshes 
shows the potential value of aggregat- 
ing information in a(semi-)structured, 
machine-interpretable manner. We 


cloud services, and data formats will 
emerge to support “information and 
knowledge management,” namely, the 
(automatic) acquisition, representa- 
tion, aggregation, indexing, discovery, 
consumption, correlation, manage- 
ment, and inference of information. 
Doing so at scale would significantly 
improve the way we discover and share 
information and how we collaborate. 
We have described a representa- 
tive set of investments we are making 


| to ease the transition of researchers 


toward a world where information is 
produced and consumed in a struc- 
tured and semantics-rich manner 
(more information about the work and 


research tools offered by Microsoft Re- | 


search for scientists can be found at 
http://research.microsoft.com/en-us/ 


collaboration/about/). However, this | 


| will not happen instantly. There is a lot 
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viewpoints 


of unstructured data out there already. 
Data-mining technologies are neces- 
sary to automatically extract as much 
semantically rich information as pos- 
sible. For example, Microsoft’s Live 
Labs has worked on machine learn- 
ing-based technologies to extract en- 
tities from the unstructured Web (see 
http://livelabs.com/projects/entity- 

extraction/). The research world needs 


| similar technologies to be deployed at 


scale that can aggregate, index, and 
mine research-related information. 
We believe such an ecosystem of 
semantics-aware tools and_ services 
will ultimately become the norm in our 
day-to-day interactions with comput- 


_ ers, constituting a global “smart cy- 
_ berinfrastructure.” However, if the big 
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| companies are to invest in implement- 


ing these ideas and technologies in 
their offerings (products and services), 
the research community must test and 
demonstrate their potential as part of 
the community’s attempt to build a 
smart cyberinfrastructure for research. 


| Ultimately, this vision of a data mesh 


and smart cyberinfrastructure will go 
some way toward realizing the visions 
of the early pioneers like Vannevar 
Bushé and J.C.R. Licklider.® 
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A Threat 
Analysis 
of RFID 
Passports 


the bank statements, you suddenly 
realize that someone has been chare- 
ing large amounts of money to an 
account in your name from a credit 
card company you’ve never heard of. 
You've lost thousands of dollars, and 
suddenly you aren’t feeling quite so 
| relaxed anymore. 

How could someone have been 
stealing money from you like this while 
you were away on vacation? The thiev- 
ery actually began months before you 

| even left home. Several months ago, 
as you were casually walking through 
the airport en route to a business meet- 
ing in Europe, someone was linger- 
ing close behind. As you approached 
a security agent to have your passport 
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checked, this individual used a small 
antenna connected to a computer in 
his backpack to eavesdrop on the radio 
communication between the security 
agent’s reader, which has the capac- 
ity to decrypt the highly sensitive and 
secured data on the passport, and the 
RFID-enabled passportitself. 

If the attacker had tried to skim 
the information off your passport | 
imitating a legitimate reader, the chip 
would never have provided the person- 
al data within, as the correct 
would not have been given. Since the 
attacker was merely intercepting the 
commu tion with an antenna, how- 
ever, he was able to collect all of the 
data, albeit in an encoded form. Private 


cess key | 


information, including not only basic 
information about your identity but 
even a digitized photograph, had been 
stolen from you at a moment when you 
thought your passport was safely in the 
hands of a government official. You 
moved on without any clue as to how 
deeply your priv had been violated 
i that you had no idea was 
urring. 

At that point, all the perpetrator 
needed to do was use the data to cre- 
ate a new pa rt, use that passport 
to get a U.S. Social Security number 
(http://www.ssa.gov/pubs/10002.htm]), 
and then create credit card accounts in 
your name, with your identity, and run 
amok with your fi 


DECEMBER 2 


An RFID-passport attack of this 
nature is more plausible than other 
methods, such as skimming the RFID 
information. Although simple to do, 
skimming will not yield the informa- 
tion needed to enable identity theft be- 
cause of preventive measures integrat- 
ed into the system. The first of these 
measures i ption. According to 
the U.S. Department of State: “When a 
reader attempts to scan the passport, it 
engages in a challenge-response proto- 
col that proves knowledge of the pair 
of keys and derives a session key. If au- 
thentication i 
releases its data contents; otherwise, 
the reader is deemed unauthorized and 
the passport refuses read access.” 
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Additionally, newer passport cov- | this as a series of hurdles that the per- 


ers are being lined with materials that 
block RFID signals from being trans- 
mitted when the passport is closed, 
exposing the document to attack only 


petrator must overcome, starting with 


data capture, moving onto data recov- 
_ ery, and finally data reproduction. 


Let us first focus on capturing the in- 


when it is opened and displayed for a Six pieces of 
FeO pay er nl information can 
signal-blocking sleeves (http://www. 
rfid-shield.com/products.php) are also | be stolen from 
available for RFID passports. | the RFID chip on 
What Information is Compromised? | a U.S. passport: 
Six pieces of information can be stolen your name, 


from the RFID chip on a U.S. passport: a ei 
nationality, gender, 


formation from your passport, since it 
is at that point in the event chain that 
the vulnerabilities of the RFID technol- 
ogy are exploited. For successful data 
retrieval the perpetrator’s antenna 
must catch two different interactions: 
the forward channel, which is the signal 
being sent from the RFID reader to the 
RFID token; and the backward channel, 
which is the data being sent back from 
the RFID token to the RFID reader. Lab 
demonstrations* have shown that a 
| successful eavesdrop (a capture of both 
channels) on an RFID tag can occur at 
a distance of one meter with the use of 
an H-field antenna, a radio frequency 


your name, nationality, gender, date 


of birth, place of birth, and a digitized | H 
photograph.' Numerous problems of | date of birth, 
identity theft could arise from some- | place of birth, 
one taking that information, but this and a digitized 
photograph. 


article focuses on the financial risk. 
Banks in the U.S. require that ap- 


plicants for credit cards submit their 
Social Security numbers to be used for 
background credit checks. Although 
the passport RFID tag does not carry 
your Social Security number, a perpe- 
trator can use the information it does 
contain to obtain your number. 

The Social Security Administra- 
tion’s Web site (http://www.ssa.gov/ 
pubs/10002.html) requires one of three 
proofs of identity for a U.S. citizen to 
be issued a new Social Security card: a 
driver’s license, state-issued non-driver 
identity card, or passport. With the data 
stolen from your passport’s RFID chip, 
someone could create a copy of the 
passport, then use this counterfeit one 
to access a real copy of your Social Secu- 


rity card. With this card, the perpetra- | 
tor is free to apply fora real copy of your | 


credit card, not to mention opening new 
accounts in your name. This puts you 
at a serious financial risk, all because 
someone was able to eavesdrop on your 
passport’s RFID communication. 


Technology Requirements 

To eavesdrop on your passport infor- 
mation, a perpetrator needs hardware 
to capture the signal as it is being 
scanned by a legitimate RFID reader, 
such as those used by government of- 
ficials at airports. He or she would then 
need the time and technical capacity to 
decrypt the signal into a usable form. 
Finally, to reap any real benefits from 
the stolen information, the attacker 
must have all the materials necessary 
to reproduce a passport. We can view 
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receiver, an oscilloscope to monitor the 
signals, and a computer to store, ana- 
lyze, and manipulate the data. 

In the lab this was done as a proof of 
concept, but in the real world a perpe- 
trator could use smaller, more discrete 
hardware. In our airport scenario, the 
perpetrator would need only an an- 
tenna and an amplifier to boost the sig- 
nal capture, a radio-frequency mixer 
and filter, and a computer to store the 


| data. The amplifier itself would not 


even need to be that powerful, since 
it would need to boost the signal over 


| only a short distance of three to five 


meters. The antenna, mixer, and filter 
can be homemade with cheap materi- 


als or purchased as a set online. Some 


Web sites (for example, http://www. 
openpcd.org/openpicc.0.html) con- 
tain schematics, lists of materials, and 
steps on how to build your own RFID 
reader the size of a matchbox. These 
RFID “sniffers” can then be plugged 
into a laptop via a USB port. 

Once the perpetrator has success- 
fully eavesdropped on the communica- 
tion between the RFID token and the 
RFID reader, the next step is data recov- 
ery. This requires two separate steps. 
The first is recovering the actual signal 
between the RFID chip in the passport 
and the RFID reader. This is a signal- 
processing problem, essentially sepa- 
rating the actual signal from the noise 
of the background. Proof-of-concept 
experiments* have shown that data 
recovery is a brute-force problem that 
can be solved with current hardware. A 


ALEX GRIMM/GETTY 


PHOTOGRAPH BY 


perpetrator would need only to record 
the data passed between the RFID and 
receiver on location, and then could 
perform the time-consuming signal- 
processing operations at home. A large 
part of data recovery is extracting the 
data from the electrical noise of the en- 
vironment, which is simplified by tak- 
ing a noise profile of the environment. 


The same Web sites that provide sche- | 


matics for readers also provide code 
for decoding the data, although the ef- 
fectiveness of their programs on new 
passports has yet to be tested. 

Once the signal has been recov- 
ered, it must be interpreted as data. 
The difficulty of this step depends 
entirely on whether and how well the 
data is encrypted. The encryption key 
is generated from information on the 
passport—specifically, the name, date 
of birth, and passport number. There 
are reports that this key can be easily 
cracked (for example, http://www.mo- 
bilemag.com/2006/02/03/global-rfid- 
passport-encryption-standard-cracked- 
in-2-hours/) because the algorithm 


used to produce the key is predictable. | 


An analysis published by the Interna- 
tional Association of Cryptologic Re- 
search indicates that the entropy of the 
resulting key is on the order of 52 bits, 
which, while something ofa challenge, 


to data from the person's e-passport. 


is not impossible to crack." We assume 
here that decryption is practical; if it 
is not, then the possibility of these at- 
tacks is minimized. 

After recovering the data, the per- 


petrator would have everything neces- | 


sary to make a new passport with the 
captured information. The steps re- 
quired for this are beyond the scope of 
this article, but since counterfeiting of 
passports has been demonstrated and 
documented, it is enough to say that 
this is feasible. 


Costs to the Perpetrator 


| What we have shown so far is that with 


the right equipment and skill, a perpe- 
trator can intercept the signal between 
a passport and RFID reader, then forge 
the passport to use for identity theft. 
The more important question, howev- 
er, is whether the cost of doing this can 
be justified by the return. 

This question is predicated on the 
assumption that the encryption of the 
information held in the passport’s 
RFID tag can be broken. While there 
is some evidence this has been true 
in the past, stronger encryption could 
increase the cost of the attack consid- 
erably, to the point of making it either 


| economically unattractive or techni- 
' cally impossible. 
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EasyPass, a new automated border control system at Frankfurt International Airport, scans passenger biometric data and compares it 
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In our airport scenario, a perpetra- 
tor would have to cover several costs 
before reaching the ultimate goal of 
financial gain. To begin with, there are 
the hardware costs. The combined cost 
of the antenna, amplifier, radio mix- 
er, filter, USB connection, and laptop 
would be on the order of $1,000. These 
are all fixed costs, and the perpetrator 
would presumably amortize these by 
using the hardware to execute numer- 
ous attacks over a period of time. 

There is also cost associated with 
access to the passport reader. It is rea- 
sonable to assume that the perpetra- 
tor would have to purchase an airline 
ticket to enter the area where passports 
are scanned. 

The cost of being caught must be 
factored in. Compared with other tech- 
nologically intensive (for example, on- 
line) fraudulent attacks, theft of pass- 
port RFID data might involve greater 
risk because of the physical proxim- 
ity required to eavesdrop on the RFID 
communication. The risk-adjusted 
cost of being caught is quite significant 
when you consider the prevalence of 
security officers within airports and 
the severity of the crime. 

Presuming that the attacker man- 
ages to escape with the raw data from 
an eavesdropping operation, it still 
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has to be interpreted at home. The 


software costs are negligible (open | 


source code for this specific function 
is available on the Internet) as are the 
costs of the processing time. In one 
example, it took less than an hour to 
recover the passport signal, and this 
process can be automated.’ Although 
we have not verified this (since verifi- 
cation would require snooping a pass- 
port in a noisy environment such as 
an airport), the approach presented 
seemed plausible. 

Jeroen van Beek of the University of 
Amsterdam managed to forge a pass- 
port RFID chip for $120.’ This cost is 
not always necessary because a U.S. 
passport remains valid even if it is not 
fitted with an RFID chip or if the chip 
has failed. (Since all passports issued 
after 2007 have an embedded RFID 
chip and are valid fora maximum of 10 
years, the ability to use a passport with- 
out such a chip will end after 2017.) 
Rather, the most significant cost is 
in obtaining or producing a realistic- 
looking passport in which to print the 
information. The cost of a blank pass- 
port book is difficult to determine, but 
there are some indications that it is 
not an insubstantial part of the cost of 
this form of identity theft. In 2008, for 
example, 3,000 blank U.K. passports 
were stolen, and officials valued each 
one at approximately $3,000. 

Estimating the revenues that could 
be generated also requires some in- 
ference. In the U.S., the mean fraud 


amount per victim for identity theft-re- | 


lated crimes in 2008 was $4,849.” The 
potential revenue from the passport 
identity theft example, however, could 
conceivably be higher because of the 


relative ease with which a passport | 


can be used to open new accounts and 
prove identity, in comparison with the 
most common current forms of fraud 
using stolen credit cards, checks, or 
mail. Nevertheless, comparing this fig- 
ure to the $3,000 cost of a blank pass- 
port (which is just one of the many 
costs of creating a fake passport) re- 
veals that the operation may not be as 
profitable as one might have thought. 


Countermeasures 

A number of countermeasures have 
been suggested to protect against 
RFID privacy risks (not specific to the 
passport example), including perma- 
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nent tag deactivation (“killing”), tem- 
porary tag deactivation (such as using 
Faraday cages or sleep/wake modes), 
and access-control mechanisms (hash 
locks, pseudonyms, blocker tags). You 
could “kill” the RFID tag (hitting the 


_ chip with a hammer does the trick), 
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since, according to the State Depart- 
ment’s Web site, if the chip fails, the 
passport remains valid; however, most 
“killing” methods leave evidence of 
intentional damage. The other solu- 
tions would not prevent the intercep- 
tion of communications between tag 
and authorized reader, particularly at 
an airport. 

More effective countermeasures re- 
quire changes to current government 
policy. The government can take steps 
to improve the security and privacy of 
passports. The basic access-control 
system of a U.S passport encrypts 
communication between it and the 
RFID reader with a key generated from 
information written on the passport; 
the key containing the holder’s infor- 
mation is susceptible to brute-force 
attacks, however, since it has low en- 
tropy.’ One countermeasure would be 
to add a 128-bit secret, printed on the 


passport and unique to each passport, | 


to the key derivation algorithm. 

The interception of communica- 
tions between RFID tag and reader is 
possible because no material capable 
of blocking RF signals surrounds the 
passport-control area. Thus, another 


countermeasure would be to install | 


an enclosure to block RFID transmis- 
sion outside of the immediate area. In- 
creased security around the passport- 
control area could also minimize the 
possibility of intrusion on the commu- 
nication between tag and reader. 


The Final Analysis 

Having looked at the potential attack, 
the costs of that attack, and the re- 
turns, we can now ask how concerned 
we should be about such an exploit. 
Should you really be worried as you 
walk through the airport that someone 
behind you might be stripping you of 
your passport information in a grand 
scheme to rob you? 

The technical hurdles are surmount- 
able, at least in proof-of-concept dem- 
onstrations. It is possible that such an 
attack could occur, but this possibility 
must be balanced against the complex- 
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ity of the attack, the difficulty of ob- 
taining the required high-priced blank 
passport, and the limited return the at- 
tack is likely to produce. 

It seems much more likely that most 
perpetrators would resort to old-fash- 
ioned means of stealing your passport 
information, by stealing your physical 
passport itself. We recommend that it 
is more important to be careful about 
keeping your physical passport safely 
in hand than to be wary of perpetrators 
lurking behind you in line at the air- 
port attempting to exploit the RFID tag 
in your passport. 
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DNS is many things to many people— 
perhaps too many things to too many people. 


BY PAUL VIXIE 


uted, auto 


To underline our understanding of 
what DNS is, we must differentiate it 
from what it is not. The Internet econ- 
omy rewards unlimited creativity in the 
monetization of human action, and 
fairly often this takes the form of some 
kind of intermediation. For DNS, mon- 
etized intermediation means lying. The 


innovators who bring us such mon- | 


etized intermediation do not call what 
they sell lies, but in this case it walks 
like a duck and quacks like one, too. 
Notall misuses of DNS take the form 
of lying. Another frequently seen abuse 
is to treat DNS as a directory system, 
which it is not. In a directory system 
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one can ask approximate questions and 
get approximate answers. Think of a 
printed telephone white pages directo- 
ry here: users often find what they want 
in the printed directory not by knowing 
exactly what the listing is but by start- 
ing with a guess or a general idea. DNS 
has nothing like that: all questions and 
all answers are exact. But DNS has at 
least two mechanisms that can be mis- 
used to support approximate matching 
at some considerable cost to everybody 
else, and a lot of that goes on. 


Stupid DNS Tricks 
The first widespread form of a DNS 
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lie was to treat DNS lookups as map- 
ping requests. Content distribution | 
networks (CDNs), such as Akamai, and 
Web optimizer products, such as Cisco 
Distributed Director, treat incoming | 
DNS lookups as opportunities to direct 
the activities of Web browsers. Using 
the IP source address of a DNS request, 
these products and services try to guess 
the proximity of the requester to each of 
many replicated content servers. Based 
on the measured load of each content 
server’s system and network, and onan 
estimate of each content server’s prox- 
imity to that requester, a DNS response 
is crafted to direct that requester to the 
closest or best content server for that 
URI domain. 

Problems abound from this ap- 
proach, but none affects the CDN op- 
erator’s revenue. First and foremost it 
is necessary to defeat or severely limit 
caching and reuse of this policy-based 
data (“DNS lies”). Caching and reuse, 
which once were considered essential 
to the performance and scalability of 
DNS, would allow a policy-based re- 
sponse intended for requester A also | 
to be seen by requester B, which might | 
not otherwise receive the same an- 
swer—for example, when server loads | 
have changed and there’s a new bal- 
ance. The effects of this noncaching 
are a higher DNS request rate (perhaps 
leading to higher revenue for CDNs that 
charge by the transaction) and more 
network load for access-side networks 
and a slightly higher floor for average 
transaction time. 

Furthermore, it has never been 
wise to assume that a DNS request’s 
IP source address gives any hint of an 
end-system Web browser’s network lo- 
cation. This is because DNS requests 
heard by a CDN come from recursive 
DNS servers as a result of cache misses; 
they do not come from end systems 
themselves. Some ISPs _ regionalize 
their recursive name servers, allowing 
CDNs to encode rules improving the | 
quality of their estimates. Many recur- 
sive name servers are per-country or 
per-continent or even per-hemisphere, 
however, so it’s always necessary for a 
CDN to deploy well-connected super- 
nodes, and these always end up hear- | 
ing a lot of out-of-region requests. 

The primary benefit of a CDN is ae 
same as gimmick-free outsourcing: i 
gives a content owner somebody to sue | 
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if things don’t go well. That DNS system 
performance and stability must pay the 
price for such liability shielding is at 
best unfortunate. Given a CDN still re- 
quires supernodes that will hear many 
out-of-region requests, a gimmick-free 
approach here would be to answer DNS 
truthfully and let existing pseudoran- 
dom distribution mechanisms do their 
work. Noting that there is no patent 
on the existing pseudorandomization 
technologies and that nobody ever got 
fired for buying a CDN, we can expect 
to see more content distributed this 
way in the decades to come. 


NXDOMAIN Remapping 
Fairly often, as in millions of times per 


| second worldwide, somebody looks up 


a domain name in DNS that isn’t there. 
Maybe this is a user at a Web browser 
making a typographical error, or may- 
be there’s a broken link on a Web site, 
or maybe a hardware or software error 
is causing nonexistent names to go 
into DNS requests. One way or another, 
the answer is generally supposed to 
be NXDOMAIN (sometimes written as 
RCODE=3). These negative answers are 
cacheable, as is any other kind of DNS 


information, since DNS is designed to | 


express truth, not policy. A network ap- 
plication (perhaps a Web browser, or 


mail server, or indeed anything at all | 
that uses TCP/IP flows to do its busi- | 


ness) that gets back one of these nega- 
tive responses is supposed to treat it as 
an error and reject its own underlying 
work item that led to this lookup. Fora 
Web browser, rejection takes the form 
of an “error page.” For a mail server, 
rejection takes the place of “bounced 
email.” Every TCP/IP application, large 


orsmall, newor old, knows howtocope | 


with NXDOMAIN. 


The Web has changed the rules. | 
| Though the Web is young—and though 
| the Internet was here before the Web 


and will be here after the Web and is 
much larger than the Web—the fact 
remains that the Web is what end us- 
ers are looking at. Advertisers have a 
whole language to describe the value 
of end users, with words such as “im- 
pressions,” “click-throughs,” and 
“eyeballs.” Why on earth, these adver- 
tisers ask, would you ever send back 
an NXDOMAIN if an impression was 
possible? So it is, increasingly, that in 
place of the NXDOMAIN your applica- 
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Author Paul Vixie, noted by Wired magazine 
as “the godfather of DNS,” has been solving 
DNS errors and mysteries for over 20 years. 


tion knows how to handle, if you ask for 
aname that does not exist, you’ll get a 
positive (deceptive; false; lying) answer 
that your application also knows how 
to handle. 

For example, if I ask my own recur- 
sive name server for a name that does 
not exist, it will tell me NXDOMAIN. If 
Iask OpenDNS’s recursive name server 
for a name that does not exist, it will 
send me a NOERROR response with an 
answer pointing at an advertising serv- 
er. Note that I’m using OpenDNS as a 
convenient example; it did not invent 
this technique. Indeed, Nominum and 
other DNS vendors now sell an add-on 
to their recursive name service prod- 
ucts to allow any ISP in the world to do 
this, and a growing number of ISPs are 
doing it. Why so many? The answer is 


simply whoever remaps these NXDO- 
MAIN responses gets the impression 
revenue. There are unverified claims 
that some ISPs are blocking access to 
OpenDNS and/or all non-ISP name 


servers in order to force their custom- | 


ers to use the ISP’s own name server. 


I say unverified, but I find the claims | 


credible—ISPs have wafer-thin 
gins and if they see this kind of manna 
going out the door, they can’t just let it 
happen. 

To demonstrate the extreme desire 
to capture this revenue, a true story: A 
few years ago VeriSign, which operates 
the .COM domain under contract to 
ICANN (Internet Corporation for As- 
signed Names and Numbers), added a 
wild card to the top of the .COM zone 
(*.COM) so that its authoritative name 
servers would no longer generate NX- 
DOMAIN responses. Instead 
generated responses containing the 
address of SiteFinder’s Web site—an 


mMmar- 


they | 


advertising server. The outcry from the 
community (including your humble 
narrator) was loud and long, and before 


ICANN had a chance to file a lawsuit to | 


stop this nonsense, many people had 
patched their recursive name servers to 
remap any response froma .COM name 
server that was not a delegation (for ex- 
ample, telling how to find the Google. 
com name servers) back into an NXDO- 
MAIN. Some ISPs put logic into their 
policy-based routers to turn SiteFinder 
responses into pointers to the ISP’s 
own advertising server instead. 


Damage Control 

NXDOMAIN wasn’t designed to be a 
revenue hook—many applications de- 
pend on accurate error signals from 
DNS. For example, consider the “same 
origin trust model” used for Web 
cookies. If you’re holding a cookie for 
Google.com and you can be fooled into 
following a link to KJHSDFKJHSKJH- 
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MJHER.GOOGLE.COM, and the result- 
ing NXDOMAIN response is remapped 
into a positive answer to some advertis- 
ing server, then you’re going to send 
your Google.com cookie to that adver- 
tising server when you send your HTTP 
GET request there. Not such a bad 
thing for a Google.com cookie, but a 
real problem for a BANKOFAMERICA. 
COM cookie. (Thanks to Dan Kaminsky 
for telling me about the “same origin 
trust model” problem.) 

Remapping could also cause email 
to be captured if a mail exchanger 
(MX) request is captured in this way. 
Many NXDOMAIN remappers try to 
avoid this by triggering only on A 
(address) requests, but to make this 
work they have to turn off caching, 
since NXDOMAINS are not type spe- 
cific and since an SMTP initiator will 
fall back to type=A if it gets no answer 
from type=Mx. Similar protections (de- 
signed to keep lawsuits away while still 
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attracting revenue) include the idea of 
triggering the remapping logic only if | 
the query domain begins with WWW.— 
but as far as I know there are a lot of ty- 
pographical errors beginning WW., or 
ending with .CM, so Ido not hold outa 
lot of hope for it long term. Too much 
money is involved, and nobody wants 
to leave it on the table (where, in this 
case, it belongs). 


that as with any other kind of informa- 


the economy as opt-in, it’s valuable to 
the debate that this IETF draft’s au- 
thors have said that Web cookies aren’t 
good enough. Others may disagree but 
at least this point is now on the table. 
This document also talks about “legally 
mandated” DNS redirection, which is 
exactly the nightmare it sounds like it is 
and that we can all hope becomes a his- 
torical curiosity as rapidly as possible. 
The absolutely best part of this IETF 
draft is Section 10—DNSSEC Consid- 
erations—that ends as follows: “So the 
only case where DNS security exten- 
sions cause problems for DNS Redirect 


Standard Bad Practices 

There is at the time of this writing an 
IETF draft (think: proto-RFC) on the 
recommended configuration and use 
of DNS redirect by service providers | 
(http://tools.ietf.org/html/draft-livin- 
good-dns-redirect-00). The goal of this 
document is to present some rules for 
how DNS lies should be delivered in 
order to give all the vendors and opera- 
tors in this growing market a common 
frame of service. Some Luddites may 
feel that the “standard best practice” 
in this area is simply not to do it at all, 
but this being unrealistic, we now face 
standards action. As a standard feature 
of DNS technology we can expect a day 
to come when all DNS services are de- | 
livered this way and our kids think of 
end-to-end DNS the way they think of 
eight-track tapes. 

This document makes a substantial 
contribution to the debate around this 
feature area by suggesting that opt-out 
for this service should be a network 
layer attribute (in other words, asso- 
ciated with one’s Ethernet [MAC] ad- 
dress or equipment port number) and 
not a transport layer attribute. Noting 


case doesn’t have widespread deploy- 
using trust anchor, configured by the 


be used to sign the redirected answers. 
As noted above in Section 9.7, such im- 
proper redirection of valid responses 
may also cause DNSSEC trust verifica- 
tion problems. 


A Rescue Being Thought Of 
Fifteen years ago a bunch of ivory tower 


model has evolved over time, and now 
this set of protocol enhancements 
(DNSSEC) is more or less ready for de- 
ployment and more or less allows for 
the possibility that DNS liars will be 


SEC has taken too long and is a com- 
mittee-based horror in its inelegance 
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tion, spam opt-out is not as good for | 


ment now and could be mitigated by | 


applicable ISP or DNS ASP, that could | 
| public key for the Google.com zone is 
| published in the COM zone and so on. 


is with a validating stub resolver. This | 


theoreticians got together at IETF and | 
said, “Let’s secure DNS.” The threat | 


caught and ignored. Noting that DNS- | 


and complexity, here’s how it’s sup- 
posed to work and how it may help cur- 
tail the current market in DNS lies. 

In DNS, data producers are the au- 
thoritative name servers, each of which 
is the delegated authority for one or 
more zones. For today, think of a DNS 
zone as everything at or below a certain 
name, so, for example, www.google. 
com is in the Google.com zone. DNS- 
SEC allows these zones to be signed 
and verified using public-key cryptog- 
raphy. The private (signing) key is used 
by the editor of the zone to generate 
signature records for each set of real re- 
cords. The public key is used by recur- 
sive name servers to verify that the data 
they receive was signed by the holder of 
the corresponding private key. Public 
(verification) keys are published using 
DNS itself, by including each zone’s 
key in the zone’s parent zone—so the 


I’m deliberately skipping a long and 
unpleasant story about where the pub- 
lic key for COM is supposed to be pub- 
lished, since it’s not germane to this 
article. 

In theory, an end-system owner who 
does not like being lied to can work co- 
operatively with zone editors who don’t 
like their zones getting lied about if 
each of them deploys DNSSEC. An ap- 
plication that is supposed to receive an 
NXDOMAIN but that today receives a 
pointer to an advertising server would 
in a DNSSEC world receive a “signa- 
ture not present” error. This would be 
an error because DNSSEC has a way 
to inform a validator that a signature 
should have been present. Note that in 
the vast majority of cases zone editors 
don’t care whether their zones are be- 
ing lied about, and, therefore, DNSSEC 


will remain silent most of the time. 


/ Consider also that this was not the 


| original DNSSEC threat model; we re- 


ally thought we had to stop on-the-wire 
corruption such as that discovered by 


| Dan Kaminsky in 2008, and the idea of 


stopping in-the-middlebox corruption 
such as NXDOMAIN remapping really 
is just gravy. 

DNSSEC will complicate life for CDN 
providers using Stupid DNS Tricks, but 
it won’t end that war since it’s still pos- 
sible to sign every policy-based answer 


~ and keep all the answers and signa- 


tures available, and still send different 


answers to the same question based on 
requester identity and policy, and have 
the signatures all be perfectly valid. 

DNSSEC will also complicate life for 
sys admins and application develop- 
ers. We (ISC—the BIND people) are do- 
ing what we can to improve on that in 
BIND 9.7, and there are plenty of other 
service and technology providers in the 
space as well. The killer app for DNS- 
SEC will be a Web browser and Web 
server that can authenticate to each 
other without using X.509 (volunteers 
are hereby encouraged to get together 
and try to make that happen). 


Directory Services 

Browser implementers including Mi- 
crosoft and Mozilla have begun do- 
ing DNS queries while collecting URIs 
from their graphical front end in order 
to do fancy “autocompletion.” This 
means that during the typing time ofa 
URI such as http://www.cnn.com/, the 
browser will have asked questions such 
as W, WW, WWW, WWW.C, WWW.CN, 
WWW.CNN, and so on. It’s not quite 
that bad, since the browsers have a 
precompiled idea of what the top-level 
domains are. They won’t actually ask 
for WWW.C, for example, but they are 
now asking for WWW.CN, which is in 
China, and WWW.CNN.CO, which is in 
Colombia. 

Although one simple-sounding so- 
lution is for Microsoft and Firefox to 
buy some name-server hardware and 
network links for China and Colombia 
(and no doubt many other affected top- 
level domain operators), that won’t stop 
the information leak or remove this stu- 
pid and useless traffic from the rest of 
the network. Since the truly best solu- 
tion is, as usual, stop doing this stupid 
thing—and we all know that isn’t going 
to happen—perhaps this behavior can 
be made optional, and then we can just 
argue about what the default (opt-in 
vs. opt-out) should be. This is the first 
time in the history of DNS that some- 
one has used it prospectively, to find 
out if what has been typed is or isn’t a 
valid domain name, in order to support 
something like autocompletion. As in 
so many other novel uses of DNS, this 
is not what it was designed for. 

Had DNS been designed with this in 
mind, one of the ways we would be able 
to tell is that domain names would be 
written from highest- to lowest-order 


DNSSEC will 
complicate life for 
CDN providers using 
Stupid DNS Tricks, 
but it won’t end 
that war since it’s 
still possible to sign 
every policy-based 
answer and keep 
all the answers 

and signatures 
available, and still 
send different 
answers to the 
same question 
based on requester 
identity and policy, 
and have the 


Signatures all be 


perfectly valid. 
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term (COM.CNN.WWW). This would 
allow partial name completion just 
as happens in graphical file system 


_ browsers. Absent a complete redesign, 


which won’t happen in our lifetime 
because of the size and usefulness of 


| the installed base, all we can do is ask 


browser implementers to be smarter 
and prepare for more DNS traffic on 


| our networks. 


Conclusion 

What DNS is not is a mapping service 
or a mechanism for delivering policy- 
based information. DNS was designed 
to express facts, not policies. Because 
it works so well and is ubiquitous, 
however, it’s all too common for en- 
trepreneurs to see it as a greenfield 
opportunity. Those of us who work to 
implement, enhance, and deploy DNS 
and to keep the global system of name 
servers operating will continue to find 
ways to keep the thing alive even with 
all these innovators taking their little 
bites out of it. 

These are unhappy observations 
and there is no solution within reach 
because of the extraordinary size of the 
installed base. The tasks where DNS 


| falls short, but that people neverthe- 


less want it to be able to do, are in most 
cases fundamental to the current de- 
sign. What will play out now will be an 
information war in which innovators 
who muscle in early enough and gain 


| enough market share will prevent oth- 
| ers from doing likewise—DNS lies vs. 


DNS security is only one example. 
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this reason reducing 
ic 


e deliverin g 


es e same instruction set 
r different performance 


and have different power characteris- 
tics. Using the same ISA on all cores 


| means running the same binary on all 


cores without the need to compile code 
with a different compiler for each core 
type. This stands in contrast to hetero- 
geneous-ISA systems, such as IBM’s 
Cell or Intel’s Larrabee, where the cores 
expose different ISAs, so the code must 
be compiled separately for each core 
type. Heterogeneous-ISA systems are 
not the focus of this article. 

A typical AMP consists of several fast 
and powerful cores (high clock frequen- 
cy, complex out-of-order pipeline, and 
high power consumption) and a large 
number of slower low-power cores (low 
clock frequency, simple pipeline, and 
low power consumption). Complex and 
powerful cores are good for running 
single-threaded sequential applications 
because these applications cannot ac- 
celerate their performance by spread- 
ing the computation across multiple 
simple cores. Abundant simple cores, 
on the other hand, are good for running 


_ highly scalable parallel applications. 


Because of performance/power 
trade-offs between complex and simple 
cores, it turns out to be much more ef- 
ficient to run a parallel application on 
a large number of simple cores than 
on a smaller number of complex cores 
that consume the same power or fit 
into the same area. In a similar vein, 
complex and powerful cores are good 
for running CPU-intensive applications 
that effectively use those processors’ 
advanced microarchitectural features, 
such as out-of-order super-scalar pipe- 
lines, advanced branch prediction fa- 
cilities, and replicated functional units. 
At the same time, simple and slow 


| cores deliver a better trade-off between 
| energy consumption and performance 
for memory-intensive applications that 


spend a majority of their execution time 
fetching data from off-chip memory 
and stalling the processor. 

A symmetric multicore processor 
(SMP) includes the cores of only one 


_ type: either the complex and power- 
| ful ones, as in the Intel Xeon or AMD 
| Opteron processors, or the simple and 


lower-power ones, as in Sun’s Niagara. 
So in terms of providing the optimal 
performance/power, an SMP is ideally 
suited for some applications but not 
all. Having cores of different types in 


a single processor enables optimizing | 
performance per watt for a wider range | 
of workloads. Having cores of different | 


types on an AMP enables us to employ 
specialization (that is, we can use each 
type of core for the type of computation 
where it delivers the best performance/ 
energy trade-off). Specialization enables 
maximizing the overall efficiency of the 
system and as a result delivers better 
performance per watt and per area. 


erature investigating the properties of | 


these systems has originated from ma- 
jor hardware players such as Intel'’ and 
HP,°° indicating that within the indus- 
try there is interest in this architecture. 
Furthermore, existing SMP systems can 
be configured to be asymmetric if one 
wants to reduce the amount of power 
consumed by the processor. For exam- 


ple, configuring some of the cores ofan | 
SMP to run at a lower-than-maximum | 


voltage and frequency (via the dynamic 
voltage and frequency scaling facili- 
ties available on most modern CPUs) 
makes the CPU consume less power 
and makes the system asymmetric. 


Although single ISA AMP systems | In that case it is crucial to understand 
are not yet being built, much of the lit- | how to get the maximum performance 
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on this asymmetric system in order to 
minimize performance losses associ- 
ated with running some cores at a lower 
Hertz. 

Given the emergence of AMPs, how 
can you best exploit them to maximize 
power efficiency (performance per 
watt)? Our goal is to shed light on some 
of the challenges software developers 
will likely face while trying to achieve 
this, and to provide some practical ad- 
vice on how to maximize exploitation of 
AMPsasaresult. To that end, we provide 
several examples demonstrating how to 
employ specialization on AMP systems 
in order to maximize performance. One 
area that will be of particular impor- 
tance is the design of scheduling algo- 
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rithms for operating systems that wish 
to take full advantage of AMPs. We also 
discuss our experience in investigating 
the design of such algorithms. We dis- 
cuss a few surprising findings that we 
have discovered about the implemen- 
tation of these scheduling strategies, 
which we think will be important to 
those who are developing or adapting 
an operating system for this upcoming 
class of processor and platform. 


Specialization on AMPs 

Efficiency of AMP systems could be 
improved using two kinds of special- 
ization: the first caters to diversity in 
thread-level parallelism; the second 
caters to microarchitectural diversity of 
the workload. 

Catering to diversity in thread-level 
parallelism. Diversity in thread-level 
parallelism refers to the two broad cat- 
egories into which applications can be 
classified: scalable parallel applica- 
tions and sequential applications. Scal- 
able parallel applications use multiple 
threads of execution, and increasing 
the number of threads typically leads 
to reduced execution time or increased 
amount of work performed in a unit 
of time. Sequential applications, on 
the other hand, typically use only one 
or a small number of threads and it is 
difficult to structure the computation 
such that it runs efficiently in a multi- 


threaded environment. In addition to | 


purely parallel or purely sequential ap- 


Figure 1. Relative speedup experienced by applications from the SPEC CPU2000 
benchmark suite from running on a fast core (2.3GHz) vs. a slow core (1.15GHz) of an 


plications, there is a hybrid type, where 
an application might have phases of 
highly parallel execution intermixed 
with sequential phases. 

These two types of applications | 
require different types of processing 
cores to achieve the best trade-off in 
performance and energy consumption. 
Suppose we have a scalable parallel ap- 
plication with a choice of running it on 
a processor either with a few complex 
and powerful cores or with many simple 
low-power cores. For example, suppose 
we have a processor with four complex 
and powerful cores and another area- 
equivalent and power-budget-equiva- 
lent processor consisting of 16 simple/ _ 
low-power cores. Suppose further that | 
each simple core delivers roughly half | 
the performance of one complex core. 
(The numbers to estimate the conver- 
sion ratios of performance and power in 
complex vs. simple cores were obtained 
from Hill and Marty.’) We configure the 
number of threads in the application 
to equal the number of cores, which is 
a standard practice for compute-inten- 
sive applications. If we run this paral- 
lel application on the processor with 
complex cores, then each thread will 
run roughly twice as fast as the thread 
running on the processor with simple 
cores (assuming that threads are CPU- 
intensive and that synchronization and | 
other overhead is negligible), but we 
can use only four threads on the com- 
plex-core processor vs. 16 threads on 


emulated AMP system. The maximum achievable speedup is a factor of 2. The more 


memory-intensive the application is the less speedup it experiences. More details on the 
experimental setup can be found in Shelepov." 


the simple-core processor. Since using 
additional threads results in a propor- 
tional performance improvement in 


| this application, we get twice as much 


performance running on a simple-core 
processor as on a complex-core proces- 
sor. Recalling that these two processors 
use the same power budget, we achieve 
twice as much performance per watt. 

Contrast this to running a sequen- 
tial application, which cannot increase 
its performance by using additional 
threads. Therefore, using a_ single 
thread, it will run twice as slow on a 
simple-core processor than on a com- 
plex-core processor, meaning we get 
twice as much performance per watt 
running on the complex-core system. 
An experienced reader will observe 
that power consumption on a simple- 
core system for this single-application 
workload could be reduced by turning 
off unused cores. Unfortunately, it is 
not always possible to turn off unused 
cores completely, especially if they are 
located in the same power domain as 
the active cores. Furthermore, an op- 
erating-system power manager may be 
configured to avoid putting the unused 
cores in a deep sleep state, because 
bringing the cores up from this state 
takes time. Thus, if a new application 
begins running or if the operating sys- 
tem needs a core for execution, then ad- 
ditional latency will be incurred while 
the dormant core is being brought up 
in the active power state. 

This example demonstrates that ap- 
plications with different levels of paral- 
lelism require different types of cores to 


_ achieve the optimal performance-per- 


watt ratio. AMP systems offer the poten- 
tial to resolve this dilemma by providing 
the cores of both types. Another advan- 


| tage of having both “fast” and “slow” 


cores in the system is that the fast ones 
can be used to accelerate sequential 
phases of parallel applications, mitigat- 
ing the effect of sequential bottlenecks 
and reducing effective serialization. Hill 
and Marty demonstrated that for paral- 
lel applications with sequential phases 


_ AMPs can potentially offer performance 
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significantly better than SMPs, as long 
as sequential phases constitute as little 
as 5% of the code." 

Catering to microarchitectural diver- 
sity of the workload. The relative benefit 
that an application derives from run- 
ning on a fast core rather than a slow 


one depends on the microarchitectural 
properties of the program.***'' Some 
programs are very efficient at using the 
CPU pipeline: they have a high amount 
of instruction-level parallelism, mean- 
ing that a processor can issue many in- 
structions in parallel without running 
out of work. These programs show good 
locality of memory accesses. As a result 
they rarely access the main memory and 
thus rarely stall the processor. We refer 
to these programs as CPU-intensive. 

At the other extreme are programs 
that use the CPU pipeline very ineffi- 
ciently. They typically have a high pro- 
cessor cache-miss rate and thus stall 
the CPU pipeline, because they have to 
wait while their data is being fetched 
from main memory. We refer to these 
programs as memory-intensive . (Note 
that this is not the same as an I/O- 
bound application, which often relin- 
quishes the CPU when it must perform 
device 1/O. A memory-intensive applica- 
tion might run on the CPU 100% of its 
allotted time, but it would use the CPU 
inefficiently.) 

CPU-intensive programs use the 
hardware of fast cores very efficiently; 
thus, they derive relatively large ben- 


efits from running on fast cores relative | 


to slow cores. Memory-intensive appli- 
cations, on the other hand, derive rela- 
tively little benefit from running on fast 
cores. Figure 1 shows some example 
speedup ratios of applications in the 
SPEC CPU2000 suite on an emulated 
AMP system. An SMP was used to emu- 
late this AMP system using dynamic 
frequency scaling. The fast core was 
emulated by running a core at 2.3GHz; 
the slow core was emulated by using the 
frequency of 1.15GHz. Note that some 
applications experience a 2x speedup, 
which is proportional to the differ- 
ence in the CPU frequency between 
the two processors. These are the CPU- 
intensive applications that have a high 
utilization of the processor’s pipeline 
functional units. Other applications 
experience only a fraction of the achiev- 
able speedup. These are the memory-in- 
tensive applications that often stall the 
CPU as they wait for data to arrive from 
the main memory, so increasing the 
frequency of the CPU does not directly 
translate into better performance for 
them. For example, a memory-intensive 
application equake speeds up by only 
25% when running on the fast core. 


Having cores of 
different types in 

a single processor 
enables optimizing 
performance per 
watt for a wider 
range of workloads. 
Having cores of 
different types 

on an AMP enables 
us to employ 
specialization. 
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For systemwide efficiency, it is 
more profitable to run CPU-intensive 
programs on fast cores and memory- 
intensive programs on slow cores. This 
is what catering to microarchitectural 
diversity of the workload is all about. 
Recent work from the University of Cali- 
fornia, San Diego and HP demonstrated 
that AMP systems can offer up to 63% 
better performance than can an SMP 
that is comparable in area and power, 


| provided that the operating system em- 


ploys a scheduling policy that caters to 
the microarchitectural diversity of the 
workload.° 


Asymmetry-Aware Scheduling 
Employing specialization is the key to 
realizing the potential of AMP systems. 


| Specialization on AMP systems will not 


be delivered by the hardware; it is up 
to the software to employ asymmetry- 
aware scheduling policies that tailor 
asymmetric cores to the instruction 
streams that use them most efficiently. 
Athread scheduler must be aware of the 


| asymmetric properties of the system 


and assign threads to cores in consid- 
eration of the characteristics of both. 
In this section we report on our experi- 
ence in designing and implementing 


| such asymmetry-aware scheduling al- 


gorithms in a real operating system. We 
first describe an algorithm that caters 
to diversity in thread-level parallelism 
and then an algorithm that caters to di- 
versity in the workload’s microarchitec- 
tural properties. 

A Scheduler Catering to Diversity in 
Thread-Level Parallelism. The idea be- 


| hind our parallelism-aware (PA) sched- 


uleris simple: itassigns threads running 
sequential applications or sequential 
phases of parallel applications to run on 
fast cores and threads running highly 


_ scalable parallel code to run on slow 


cores. The following example demon- 
strates that the PA scheduling policy can 


| achieve a much better system efficiency 


VOL, 52 


than an asymmetry-unaware policy. We 
emphasize that the goal in using the PA 
policy is to maximize systemwide effi- 
ciency, not to improve performance of 
particular applications. As a result this 
policy will be inherently unfair: some 
threads will have a higher priority than 
others in running on fast cores. Impli- 
cations of a policy that equally shares 
fast cores among all threads are demon- 
strated in our earlier study.’ 
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Consider an asymmetric processor 
with one fast core and nine slow cores, 
where the fast core delivers approxi- 
mately twice as much single-threaded 
performance as the slow core. Let’s say 


Sequential phase runs 


on a SLOW core 
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Figure 2. An illustration of how a PA scheduler would accelerate a parallel application 
limited by a sequential bottleneck on an AMP processor. 


we run a workload of one sequential 
application and one parallel applica- 
tion, and the parallel application has 
nine threads. Under a naive scheduling 
policy that equally shares fast and slow 


Sequential phase runs 
ona FAST core 
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Completion time 
reduced by 25% 


Figure 3. Speedup achieved with a PA algorithm over the asymmetry-agnostic default 


scheduler on an emulated AMP system. 
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Figure 4. Speedup achieved with a PA algorithm over the asymmetry-agnostic default 
scheduler using the busy-wait and adaptive synchronization modes. 
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cores among threads, each thread will 
run on the fast core 10% of the time and 
on the slow core 90% of the time. (Note 
that existing operating-system schedul- 
ers would not share complex and simple 
cores equally. Distribution of time on 
different core types would be arbitrary. 
We assume a policy that shares cores 
equally to simplify the example.) To sim- 
plify comparison of different schedul- 
ing policies, we use as our performance 
measure the overall workload speedup 
relative to running all threads on slow 
cores the entire time. Under this naive 
policy, each thread will speed up by 1.1 
relative to running on a slow core (to 
work this out, consider that each thread 
runs at a speed of 2x for 10% of the time 
and at a speed of 1x for 90% of the time), 
and the workload-wide speedup will also 
be 1.1x. Note that when computing the 
speedup for a parallel application we as- 
sume that the speedup for the entire ap- 
plication is close to the average speedup 
of its threads rather than the aggregate 
speedup—this assumption is reasonable 
if threads of a parallel computation are 
working on a common task as opposed 
to performing unrelated tasks requiring 
no inter-thread communication. 

Under a PA policy, the single-thread- 
ed application will run on the fast core 
for the entire time, and the threads of 
the parallel application will run on slow 
cores. As a result, the single-threaded 
application will speed up by a factor of 
2x,and the parallel application will have 
no speedup (1x). The average speedup 
for the two applications will be 1.5x, or 
40% better than under the naive policy. 

As another example, consider a par- 
allel application with 50% of its code 
executing sequentially. An asymmetry- 
unaware scheduler may fail to assign 
the bottleneck sequential phase to run 
ona fast core, but a PA scheduler would 
make sure to accelerate it. Suppose the 
fast core runs twice as fast as the slow 
core: the PA scheduler would deliver up 
to 25% performance improvement to 
that application (see Figure 2). 

Figure 3 shows the performance of a 
number of parallel applications on an 
emulated AMP system with our imple- 
mentation of a PA scheduler in OpenSo- 
laris relative to the default asymmetry- 
agnostic scheduler in that operating 
system. To emulate AMP we use a real 
multicore system (AMD Opteron with 
16 cores), and we emulate fast cores by 


using a high clock frequency (2.3GHz) 
and slow cores by using a low frequency 
(1.15GHz). The implementation of the 
algorithm and the experimental plat- 
form are described in more detail in 
a previous work.'° In this experiment 
we use four fast and 12 slow cores. 
The applications used in Figure 3 are 
drawn from several benchmark suites, 
such as SPEC OpenMP 2001, PARSEC, 
MineBench, and NAS. RNA is a bioin- 
formatics application performing RNA 
sequence matching. 

The figure shows a variety of speedup 
values. Applications with significant se- 
quential phases (40%-60%) experience 
a performance improvement of up to 
26% relative to an asymmetry-unaware 
scheduler. Applications with small se- 
quential phases (wupwise, for example) 
experience no speedup from the PA pol- 
icy, because they do not stand to benefit 
from acceleration of sequential phases 
on fast cores. 

Here, we discuss two main challeng- 
es involved in delivering the benefits 


of the PA scheduling policy to applica- | 


tions: effectively detecting sequential 
and parallel phases and avoiding per- 
formance overhead that may result 
from cross-core migrations. 

Detecting sequential and parallel 
phases. The simplest way to detect par- 
allel and sequential phases in an ap- 
plication is to use the runnable thread 


count as a heuristic. If an application | 
uses a large number of threads, thenits | 


runnable thread count will be high, and 
this application would be in a parallel 
phase. Conversely, an application with 


a single runnable thread would be ina | 
sequential phase. The great property | 


of the runnable thread count is that in 
modern multithreading environments 
it is visible to the operating system, be- 


cause these systems map application- | 


level threads to kernel-level threads. 
Therefore, by monitoring the runnable 
thread count an operating system can 


distinguish between parallel and se- | 


quential phases in applications. 


Unfortunately, in some situations | 


using the runnable thread count for 
detection of sequential phases might 
not work. In particular, an application 


could be running nonscalable code | 


while still using a large number of run- 
nable threads. We describe two sce- 
narios where this might happen and 
discuss potential remedies. 


Inonescenario, anapplication might 
be susceptible to an external scalability 
bottleneck—for example, as a result of 
memory bandwidth contention. In this 
case the system memory bus is saturat- 
ed, and using additional threads does 
not speed up the application because 
those threads do not contribute to use- 
ful computation. A sensible way to solve 
this problem is to reduce the number of 
threads used in an application to the 
point where the application operates 


at its peak efficiency. Essentially, this | 


boils down to configuring the number 
of threads properly in a parallel appli- 
cation. Suleman et al. describe a tech- 
nique called feedback-driven thread- 
ing, which allows you to dynamically 
determine the optimal thread count for 
parallel applications." 

In another scenario, an applica- 
tion might be limited by internal scal- 


ability bottlenecks: for example, there | 


might be a load imbalance where some 
threads do more work than others or 
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necks where one thread executes the 
code in a critical section while other 
threads wait. When threads wait they 
may either block, relinquishing the 
CPU, or busy-wait, spinning on the CPU 
in a tight loop. If threads block, then 
the runnable thread count is reduced 
and any such reduction is visible to the 
operating system; but if threads busy- 
wait, sequential phases might be hid- 
den from the operating system. 
Whether the application uses block- 
ing or busy-waiting depends on the 
implementation of the synchronization 
primitives, which are used to construct 
critical sections or barriers. Busy-waiting 
makes sense on a multiprocessor when 
the wait times are expected to be short. 
Blocking a thread is an expensive opera- 
tion that should be avoided during short 
waiting periods. If the wait time is long, 
however, blocking is usually preferred 
so as to avoid wasting CPU resources. 
One of the most popular strategies used 
in synchronization libraries is the adap- 


Figure 5. Two configurations of an AMP system. Large squares represent memory domains 


with one or more cores and a last-level cache (LLC) inside the domain. “Fast” cores are 
denoted by large red boxes, “slow” cores are denoted by small blue boxes. 
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Figure 6. Performance overhead relative to the default scheduler on a migration-unfriendly 


topology (like in Figure Sa) and on a migration-friendly topology (like in Figure Sb, but with 
three slow cores in each memory domain). Lower numbers denote higher overhead. 


@ Migration-unfriendly topology 


5% 


-15% 


Speedup over Default Scheduler 


@ Migration-friendly topology 


-25% 

Se | eee LE | en | | 1 ees | =| | 

a rs ee ee a ae 

> a S -% S 3 a Ww < 

a @ “ a = o = = 

3 3 . ” 

3 3 

DECEMBER 2009 | VOL.52. NO.12 COMMUNICATIONS OF THE ACM 53 


practice 


for a while and then blocks. On an AMP 
system, using the right synchronization 
waiting mode is crucially important. 
If busy-waiting is used, a PA scheduler 
would not be able to detect and acceler- 
ate the bottleneck serial phases. 

Figure 4 shows the performance of 
selected OpenMP applications that 
could be configured to use either the 
adaptive or the busy-wait mode in the 
synchronization library. We show the 
performance under the PA scheduler 
relative to the default OpenSolaris 
scheduler. We use the same emulated 
AMP setup as described for Figure 3, 
but in this case we configure the system 
with one fast core and 12 slow cores. 
For each application, we show the per- 
formance under the busy-wait and the 
adaptive modes. 

As the figure shows, the PA scheduler 
can deliver significant performance im- 
provements (up to 40%) for applications 
with large sequential phases but only if 
sequential phases are exposed to the 


scheduler via the adaptive synchroniza- | 


tion mode. When the busy-wait mode is 
used, the scheduler is unable to detect 


Figure 7. Process involved in modeling relative speedup. 
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sequential phases and accelerate them 
on the fast core of the AMP. 
An alternative to using the adaptive 


mode would be to implement a new | 


synchronization mode where the syn- 
chronization primitive explicitly noti- 
fies the scheduler when a thread begins 
to spin.’ Using this waiting mode will 
further ensure that spinning threads 
do not waste the resources of fast cores. 
When making changes to the synchro- 
nization library is not an option, how- 


ever, using adaptive synchronization | 


mode will do the job. 

Avoiding the overhead. A more diffi- 
cult challenge in implementing PA or 
other asymmetry-aware algorithms is 
to avoid the overhead associated with 
migrating threads across the cores. 
Any asymmetry-aware algorithm relies 
on cross-core migrations to deliver the 
benefits of its policy. For example, the 


PA algorithm must migrate a thread | 
from a slow core to a fast core if it de- | 


tects that the thread is executing a se- 
quential phase. 


Migrations are an essential tool of | 
asymmetry-aware algorithms, but un- | 
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Figure 8. Performance degradation over the best static scheduling assignment from 
two approaches used to determine relative speedup of threads on cores of different types. 
The approach based on architectural signatures outperforms the direct measurement 


approach due to a lower runtime overhead. 
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| fortunately they can be quite expensive. 
The AMP shown in Figure 5a consists of 
several memory domains, as is usually 
the case with modern multicore proces- 
sors. A memory domain is defined to 
contain cores that share an LLC (last- 
_ level cache). LLC is the last “line of de- 


| fense” on the frontier between the CPU 


and the main memory. Thus, if the re- 
quired data is not in the LLC, then the 
processor has to fetch it from the main 
memory, which takes hundreds of CPU 
cycles and slows down the computa- 
tion considerably. In contrast, fetching 
_ data from an LLC takes only a few tens 
of processor cycles. Therefore, we want 
to minimize the number of accesses to 
the main memory and try to satisfy data 
requests from an LLC or other CPU 
caches as frequently as possible. 

In Figure 5a, the fast core is located 
in a different memory domain from 
the slow cores, so every time the sched- 
uler migrates a thread to the fast core, 
the thread loses the data accumulated 
in the LLC of the slow core’s memory 
domain and must fetch the data from 
the main memory. (Depending on the 
implementation of the processor, the 
thread might have to fetch the data 
from its old LLC, not from memory, 
but that is still more expensive than 
fetching it from the LLC in its current 
memory domain.) As we will show 
later, this can cause significant perfor- 
mance overhead. 
| Consider now Figure 5b, which de- 
| picts a different AMP system where 
each fast core is located in the same 
memory domain as several slow cores. 
This architecture makes it easier for the 
scheduler to avoid cross-memory-do- 
main migrations. In this case, a sched- 
uler will try to migrate a thread to a fast 
core that is within the same memory do- 
main as the slow core where the thread 
was previously running, thus enabling 
the thread to reuse the data in the LLC. 

Figure 6 shows performance over- 
head experienced by several applica- 
tions on two emulated AMP systems: 
one is a migration-unfriendly system 
configured like the system in Figure 5a; 
and another is the migration-friendly 
_ system configured like the system in 
| Figure 5b, except that each domain has 
only three slow cores because of the 
limitations of the experimental hard- 
| ware. Our PA scheduler is designed to 
be topology-aware, meaning that it at- 


tempts to avoid cross-memory-domain 
migrations whenever possible. We mea- 
sured the overhead by making the des- 
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proposed in the research community 
to address this challenge. The first ap- 


| proach entails running each thread on 


ignated fast cores on these two systems | _ cores of different types, registering the 


run at the same frequency as the slow bs speedup obtained on a fast core rela- 
é For systemwide cee 


cores—so no performance gains were 
to be expected from asymmetry-aware 


efficiency, it is 


tive to a slow core and using the result- 
ing relative speedup as the measure for 


scheduling, but the overhead was still | = classifying the applications. In a sched- 
present, since our scheduler still mi- more profitable uling algorithm, a thread with a larger 
ae gsi across lin gaan to run CPU-bound hate speedup ae be ts ae 
at the system is asymmetric. ence to run on a fast core, and a threa 
Comparing the performance of appli- programs on with a lower relative speedup would be 
cations under the PA scheduler and the | more likely to run on a slow core. Since 
ast cores an y 


default scheduler, we can find out the mi- this approach relies on direct measure- 


gration-related performance overhead. memory-intensive ment of relative speedup, we refer to it 


In this case, performance degradation | as the direct measurement approach. 

under the PA scheduler is equivalent to programs on A second approach, referred to as 

migration overhead. As Figure 6 shows, slow cores. _ the modeling approach, is to model 

performance overhead can be quite Thi = the speedup on a fast vs. slow core us- 
is is what 


significant on a migration-unfriendly 
system, but it becomes negligible on a catering to | time properties obtained either offline 
migration-friendly system coupled with or online. Modeling is less accurate 


a topology-aware scheduler. microarchitectural than direct measurement but does not 


In summary, a parallelism-aware diversity of require running each thread on each 
scheduling policy can deliver real per- type of core, thus avoiding potential 


formance improvements on asymmet- the workload load imbalance and expensive cross- 
ric hardware for parallel applications | § core migration (we elaborate on these 
P PP is all about. g 


ing a summary of an application’s run- 


limited by sequential phases. The key 


is to configure the synchronization li- | 


brary to “reveal” the sequential phases 
to the scheduler. To avoid cross-mem- 
ory-domain migration overhead, AMP 


systems should be designed such that | 
fast cores share a memory domain with | 


some of the slow cores and combined 
with a topology-aware scheduler that 
minimizes cross-domain migrations. 


A Scheduler Catering to Microarchi- 


tectural Diversity. Remember that the 
idea of catering to microarchitectural 


diversity of the workload is to assign | 
CPU-intensive threads (or phases of exe- | 


cution) to fast cores and memory-inten- 
sive threads (or phases) to slow cores. 
Recall from Figure 1 that CPU-intensive 
code will experience a higher relative 
speedup running on fast vs. slow cores 
_ than memory-intensive code, so sched- 
uling it on fast cores is more efficient 
in a cost-benefit analysis. Just like the 
PA policy, this policy will be inherently 
unfair: it may improve performance 
of some applications at the expense of 


others, but it will improve the efficiency | 


of the system as a whole. 

The biggest challenge in imple- 
menting such an algorithm is to clas- 
sify threads or phases of execution as 
CPU-intensive or memory-intensive at 
scheduling time. Two approaches were 


issues later). In an effort to build an 
asymmetry-aware algorithm that caters 
to microarchitectural diversity, we have 
experimented with both methods. 

The direct measurement approach 
manifested several performance prob- 
lems. Consider a scenario where each 
thread must be run on each core type 
to determine its relative speedup. Given 
that a running thread may switch phas- 


| es of execution (that is, it may be doing 


different types of processing at different 
points in time), this measurement must 


| be repeated periodically; otherwise, the 
| scheduler might be operating on stale 


data. Since the number of threads will 
typically be larger than the number of 
fast cores, there will always be a high 


demand for running on fast cores for 


the purpose of remeasuring relative 
speedup. As a result, threads that are 


| “legitimately” assigned to run on fast 


cores by the scheduling policy will ob- 


- serve undue interference from threads 


trying to measure their speedup there. 
Furthermore, having too many threads 


| “wanting” to run on scarce fast cores 


may cause load imbalance, with fast 
cores being busy and slow cores being 


| idle. When we used this direct measure- 


ment approach in an asymmetry-aware 
algorithm, we found that these prob- 
lems made it difficult to deliver signifi- 
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cant performance improvementrelative 
to an asymmetry-agnostic scheduler.*" 
The modeling approach involved 
predicting relative speedup on differ- 
ent core types using certain properties 
of the running programs. Since we were 
keen on evaluating this approach on 
real hardware (simulators put limita- 
tions on the length and number of ex- 
periments that can be performed), we 
could experiment only with the asym- 
metry that was caused by the differenc- 
es in the clock frequency of different 
cores. As a result, our relative speedup 
model was tuned to work for this spe- 
cific type of asymmetric hardware. 
(This is the only type of AMP configura- 
tion available on today’s commercial 
hardware.) At the same time, we do not 
see any fundamental reasons why our 
model could not be adapted to work on 
other single-ISA asymmetric systems. 
Recall that the main factor deter- 
mining how much speedup a program 
would obtain from running on a fast 
core is how memory-intensive the pro- 
gram is. A good way to capture memory- 
intensity is via a memory reuse profile, 
a compact histogram showing how well 
a program reuses its data.’ If a program 
frequently reuses the memory loca- 
tions it has touched in the past, then 
the memory reuse profile will capture 
the high locality of reference. If a pro- 
gram hardly ever touches the memory 
values used in the past (as would a vid- 


eo-streaming application, for example), | 


the memory reuse profile will capture 
that as well. Memory reuse profiles are 
so powerful that they can be used to 
predict with high accuracy the cache- 
miss rate of a program in a cache of any 
size and associativity. This is precisely 
the feature that we relied on in evaluat- 
ing memory-intensity of programs and 
building our estimation model. 
Without going into much detail, in 
our scheduling system we associate a 


memory reuse profile with each thread. | 


We refer to this profile as the architec- 
tural signature, since it captures how 
the program uses the architectural fea- 
tures of the hardware. The idea is that 
an architectural signature may contain 
a broad range of properties needed to 
model performance on asymmetric 
hardware, but for our target AMP sys- 
tem, using just a memory reuse pro- 
file was sufficient. Using that profile, 
the scheduler predicts each program’s 


The best static 
assignment 
always results 
in running the 
CPU-bound 


applications on the 
fast cores and the 
memory-intensive 


applications 


on the slow cores. 
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miss rate in the LLC, and using that 
miss rate, it estimates the approxi- 
mate fraction of CPU cycles that this 
program will spend waiting on main 
memory. The scheduler can then trivi- 
ally estimate the speedup that each pro- 
gram will experience running on a fast 
core relative to a slow core (see Figure 
7). Then the scheduler simply assigns 
threads with higher estimated speed- 
ups to run on fast cores and threads 
with lower estimated speedups to run 
on slow cores, making sure to preserve 
the load balance and fairly distribute 
CPU cycles. The resulting scheduler 
is called HASS (heterogeneity-aware 


| signature-supported), and more details 


about its implementation are available 
in our earlier publication." 

To evaluate how well the approach 
based on architectural signatures 
helps the scheduler determine the op- 
timal assignment of threads to cores, 
we compare the resulting performance 
with that under the best static assign- 
ment. A static assignment is one where 
the mapping of threads to cores is de- 
termined at the beginning of execu- 


| tion ofa particular workload and never 


changed thereafter. The best static 
assignment is not known in advance, 
but can be obtained experimentally by 
trying all static assignments and pick- 
ing the one with the best performance. 
The best static assignment is the the- 
oretical optimum for our signature- 
supported algorithm, since it relies on 
static information to perform the as- 
signment (the architectural signature) 
and does not change an assignment 
once it is determined. 

Figure 8 shows the performance ob- 
tained using our signature-supported 
algorithm relative to the best static 


| assignment. We show the overall per- 


formance for seven workloads. Each 
workload is constructed of four SPEC 
CPU2000 applications, two of which 
are memory-intensive and two of which 


| are CPU-intensive. Each workload is 


executed on an emulated AMP system 
with two fast cores and two slow cores, 
so one single-threaded application is 


| running on each core. The fast cores 


run at 2.3GHz, and the slow cores run 


| at 1.15GHz. We used the AMD Opteron 


(Barcelona) system for this experiment. 

The best static assignment always 
results in running the CPU-intensive 
applications on the fast cores and 


the memory-intensive applications 
on the slow cores. As Figure 8 shows, 
the signature-supported algorithm is 
always able to match the best static 
assignment within a couple of per- 
centage points. In that figure we also 
report the performance of the direct 
measurement approach mentioned 
earlier. It struggles to do as well as the 
signature-supported algorithm. 

Although the signature-supported 
scheduler performs rather well, using 
itin areal scheduler involves an impor- 
tant challenge: obtaining the memory 
reuse profile needed for construction 
of the architectural signature. We are 
aware of two methods of obtaining the 
memory reuse profile: offline (used in 
our experiments) and online. 

Using an offline method we can ob- 
tain a profile by running the applica- 
tion through a profiler that is capable 
of monitoring and summarizing its 
memory-access patterns (for x86 ex- 
ecutables we used the Pin binary in- 
strumentation tool;® this approach is 
described in more detail in our earlier 
work"'). The offline-generated profile 
must be somehow attached to the pro- 
gram binary (for example, by embed- 
ding it into the binary itself). Further- 
more, it requires cooperation on the 
part of the developer, who must be re- 
sponsible for generating the memory 
reuse profile for typical executions of 
the program. 

The benefit of using offline-gen- 
erated profiles is that they require no 
runtime overhead in the scheduling 
algorithm associated with their gen- 
eration. The drawback of offline pro- 
files is that it may be difficult to gener- 
ate them accurately for multithreaded 
programs, and it may be tricky to ac- 
count for different program phases. 
If a program changes its architectural 
properties according to the phase 
that it executes, one needs to associ- 
ate multiple profiles with a single pro- 
gram. Furthermore, if the behavior of 
the program changes depending on 
the input, profiles obtained offline 
may turn out to be inaccurate. 


The other method involves gener- 


ating a memory reuse profile online. 
Although we have not experimented 
with the online approach ourselves, a 
group from the University of Toronto 


has recently proposed an efficient on- | 


line method for generating accurate 


memory reuse profiles.'* This online 
approach would not have the same 
performance overhead as the direct 
measurement approach, because it 
would not require running each thread 
on each core type; a profile is collected 
while the thread is running on a single 
core and thus would not create the as- 
sociated load imbalance. 

Finally, there may be another prom- 
ising approach for online estimation 
of relative speedup. Recall that our 
architectural signature method de- 
termines relative speedup based on 
the estimated LLC miss rates. Instead 
of estimating these miss rates from 
memory reuse profiles, we could also 


measure the miss rates online using | 


hardware performance counters. Our 
preliminary results indicate this is a 
fruitful method for determining rela- 
tive speedup online. We believe this 
method will be an effective way of es- 
timating the relative speedup dynami- 
cally while avoiding the drawbacks of 
the direct measurement technique 
and the complexity associated with 
generation of memory reuse profiles. 


Conclusion 
Asymmetric multicore systems prom- 


ise to deliver higher performance per | 


watt than conventional symmetric pro- 
cessors. To realize these benefits, how- 
ever, the workload must have sufficient 
diversity (in terms of either parallelism 
or microarchitectural properties), and 
the operating-system scheduler must 
be designed to leverage this diversity 


on asymmetric hardware. Our experi- | 


ence with designing asymmetry-aware 
schedulers demonstrates that these al- 
gorithms can be feasibly implemented 
on real systems and used to unleash 
the potential of AMP systems without 


practice 
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Easing the programmer's burden does not 
| compromise system performance or increase 
| the complexity of hardware implementation. 


BY JOSEP TORRELLAS, LUIS CEZE, JAMES TUCK, 
CALIN CASCAVAL, PABLO MONTESINOS, WONSUN AHN, 
| AND MILOS PRVULOVIC 


The Bulk Multicore 
Architecture 

for Improved 
Programmabllity 


MULTICORE CHIPS AS Commodity architecture 
for platforms ranging from handhelds to 
supercomputers herald an era when parallel 
programming and computing will be the norm. 
While the computer science and engineering 
community has periodically focused on advancing 
the technology for parallel processing,* this time 
around the stakes are truly high, since there is 
no obvious route to higher performance other 
than through parallelism. However, for parallel 
computing to become widespread, breakthroughs 
are needed in all layers of the computing stack, 
including languages, programming models, 
compilation and runtime software, programming 
and debugging tools, and hardware architectures. 
At the hardware-architecture layer, we need to 
change the way multicore architectures are designed. 
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In the past, architectures were de- 
signed primarily for performance or 
for energy efficiency. Looking ahead, 
one of the top priorities must be for 
the architecture to enable a program- 
mable environment. In practice, pro- 
grammability is a notoriously difficult 
metric to define and measure. At the 


| hardware-architecture level, program- 
| mability implies two things: First, the 


architecture is able to attain high ef- 
ficiency while relieving the program- 
mer from having to manage low-level 
tasks; second, the architecture helps 


| minimize the chance of (parallel) pro- 


gramming errors. 

In this article, we describe a 
novel, general-purpose multicore 
architecture—the Bulk Multicore— 


| we designed to enable a highly pro- 
| grammable environment. In it, the 


programmer and runtime system 
are relieved of having to manage the 
sharing of data thanks to novel sup- 


port for scalable hardware cache co- 


herence. Moreover, to help minimize 
the chance of parallel-programming 
errors, the Bulk Multicore provides 
to the software high-performance se- 
quential memory consistency and also 
introduces several novel hardware 
primitives. These primitives can be 
used to build a sophisticated program- 
development-and-debugging environ- 
ment, including low-overhead data- 
race detection, deterministic replay 
of parallel programs, and high-speed 


| disambiguation of sets of addresses. 


The primitives have an overhead low 
enough to always be “on” during pro- 
duction runs. 

The key idea in the Bulk Multi- 
core is twofold: First, the hardware 


_ automatically executes all software 


as a series of atomic blocks of thou- 
sands of dynamic instructions called 
Chunks. Chunk execution is invisible 
to the software and, therefore, puts no 
restriction on the programming lan- 
guage or model. Second, the Bulk Mul- 
ticore introduces the use of Hardware 
Address Signatures as a low-overhead 
mechanism to ensure atomic and iso- 
lated execution of chunks and help 


maintain hardware cache coherence. 
The programmability advantages of 
the Bulk Multicore do not come at the 
expense of performance. On the con- 
trary, the Bulk Multicore enables high 
performance because the processor 
hardware is free to aggressively reor- 
der and overlap the memory accesses 
of a program within chunks without 
risk of breaking their expected behav- 


ior in a multiprocessor environment. 
Moreover, in an advanced Bulk Mul- 
ticore design where the compiler ob- 
serves the chunks, the compiler can 
further improve performance by heav- 
ily optimizing the instructions within 
each chunk. Finally, the Bulk Multi- 


core organization decreases hardware 


design complexity by freeing proces- 
sor designers from having to worry | 
about many corner cases that appear | 
when designing multiprocess | 
| 
Architecture 

The Bulk Multicore architecture elim- | 
inates one of the traditional tenets of 
processor architecture, namely the 
need to commit instructions in order, 
providing the architectural state of the 
processor after every single instruc- 
tion. Having to provide such state in 
a multiprocessor environment—even 
if no other processor or unit in the 
machine needs it—contributes to the 
complexity of current system desig 
This is because, in such an environ- 
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VOL 


ment, memory-system accesses take 
many cycles, and multiple loads and 
stores from both the same and dif- 
ferent processors overlap their execu- 
tion. 

In the Bulk Multicore, the default 
execution mode of a processor is to 
commit chunks of instructions at a 
time.’ A chunk is a group of dynami- 
cally contiguous instructions (such as 
2,000 instructions). Such a “chunked” 
mode of execution and commit is a 
hardware-only mechanism, invisible 
to the software running on the pro- 

sor. Moreover, its purpose is not to 
parallelize a thread, since the chunks 
ina thread are not distributed to other 
processors. Rather, the purpose is to 


52 NO. 12 COMMUNICATIONS OF THE ACM 59 


contributed articles 


improve programmability and perfor- 
mance. 

Each chunk executes on the pro- 
cessor atomically and in isolation. 
Atomic execution means that none of 
the chunk’s actions are made visible 
to the rest of the system (processors or 
main memory) until the chunk com- 
pletes and commits. Execution in iso- 
lation means that if the chunk reads a 


location and (before it commits) a sec- | 


ond chunk in another processor that 
has written to the location commits, 


then the local chunk is squashed and 
must re-execute. 

To execute chunks atomically and 
in isolation inexpensively, the Bulk 
Multicore introduces hardware ad- 
dress signatures.’ A signature is a 
register of ~1,024 bits that accumu- 
lates hash-encoded addresses. Figure 
1 outlines a simple way to generate a 
signature (see the sidebar “Signatures 
and Signature Operations in Hard- 
ware” for a deeper discussion). A sig- 
' nature, therefore, represents a set of 


Signatures and Signature 
Operations in Hardware 


Figure 1 in the main text shows a simple implementation of a signature. The bits of an 
incoming address go through a fixed permutation to reduce collisions and are then 
separated in bit-fields C;. Each field is decoded and accumulated into a bit-field V; in the 
signature. Much more sophisticated implementations are also possible. 

A module called the Bulk Disambiguation Module contains several signature 
registers and simple functional units that operate efficiently on signatures. These 
functional units are invisible to the instruction-set architecture. Note that, given a 
signature, we can recover only a superset of the addresses originally encoded into the 
signature. Consequently, the operations on signatures produce conservative results. 

The figure here outlines five signature functional units: intersection, union, test 
for null signature, test for address membership, and decoding (8). Intersection finds 
the addresses common to two signatures by performing a bit-wise AND of the two 
signatures. The resulting signature is empty if, as shown in the figure, any of its bit- 
fields contains all zeros. Union finds all addresses present in at least one signature 
through a bit-wise OR of the two signatures. Testing whether an address a is present 
(conservatively) in a signature involves encoding a into a signature, intersecting the 
latter with the original signature and then testing the result for a null signature. 

Decoding (5) a signature determines which cache sets can contain addresses 
belonging to the signature. The set bitmask produced by this operation is then passed 
to a finite-state machine that successively reads individual lines from the sets in the 
bitmask and checks for membership to the signature. This process is used to identify 
and invalidate all the addresses in a signature that are present in the cache. 

Overall, the support described here enables low-overhead operations on sets of 


addresses.’ 


Operations on signatures. 
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addresses. 

In the Bulk Multicore, the hard- 
ware automatically accumulates the 
addresses read and written by a chunk 
into a read (R) and a write (W) signa- 
ture, respectively. These signatures 
are kept in a module in the cache hi- 
erarchy. This module also includes 
simple functional units that operate 
on signatures, performing such op- 
erations as signature intersection (to 
find the addresses common to two 


| signatures) and address membership 


test (to find out whether an address 
belongs to a signature), as detailed in 
the sidebar. 

Atomic chunk execution is sup- 
ported by buffering the state gener- 
ated by the chunk in the L1 cache. 
No update is propagated outside the 
cache while the chunk is executing. 
When the chunk completes or when a 
dirty cache line with address in the W 


| signature must be displaced from the 


cache, the hardware proceeds to com- 
mit the chunk. A successful commit 
involves sending the chunk’s W sig- 
nature to the subset of sharer proces- 
sors indicated by the directory? and 
clearing the local R and W signatures. 
The latter operation erases any record 
of the updates made by the chunk, 
though the written lines remain dirty 
in the cache. 

The W signature carries enough 
information to both invalidate stale 
lines from the other coherent caches 
(using the 6 signature operation on W, 
as discussed in the sidebar) and en- 
force that all other processors execute 
their chunks in isolation. Specifically, 


to enforce that a processor executes a 


chunk in isolation when the processor 
receives an incoming signature Win, 
its hardware intersects W;,. against 
the local R,,. and W,,. signatures. If any 
of the two intersections is not null, it 
means (conservatively) that the local 


| chunk has accessed a data element 


written by the committing chunk. 
Consequently, the local chunk is 
squashed and then restarted. 

Figure 2 outlines atomic and iso- 
lated execution. Thread 0 executes 
a chunk that writes variables B and 
C, and no invalidations are sent out. 
Signature W, receives the hashed ad- 
dresses of B and C. At the same time, 
Thread 7 issues reads for B and C, 
which (by construction) load the non- 


speculative values of the variables— 
namely, the values before Thread 0's 
updates. When Thread 0’s chunk com- 
mits, the hardware sends signature W, 
to Thread 7, and W, and Ry are cleared. 
At the processor where Thread 7 runs, 
the hardware intersects W, with the 
ongoing chunk’s R, and W,. Since Wy 
OR, is not null, the chunk in Thread 
1 is squashed. 

The commit of chunks is serial- 


ized globally. In a bus-based machine, | 


serialization is given by the order in 
which W signatures are placed on the 
bus. With a general interconnect, seri- 


alization is enforced by a (potentially | 


distributed) arbiter module.’ W sig- 
natures are sent to the arbiter, which 


quickly acknowledges whether the | 


chunk can be considered committed. 
Since chunks execute atomically 
and in isolation, commit in program 


order in each processor, and there is | 
a global commit order of chunks, the | 


Bulk Multicore supports sequential 
consistency (SC)° at the chunk level. 
As a consequence, the machine also 
supports SC at the instruction level. 


More important, it supports high- | 


performance SC at low hardware com- 
plexity. 

The performance of this SC imple- 
mentation is high because (within 
a chunk) the Bulk Multicore allows 
memory access reordering and over- 
lap and instruction optimization. As 
we discuss later, synchronization in- 
structions induce no reordering con- 
straint within a chunk. 

Meanwhile, hardware-implementa- 
tion complexity is low because memo- 
ry-consistency enforcement is largely 
decoupled from processor structures. 
In a conventional processor that is- 
sues memory accesses out of order, 
supporting SC requires intrusive pro- 


cessor modifications. For example, | 


from the time the processor executes 
a load to line L out of order until the 
load reaches its commit time, the 
hardware must check for writes to L 
by other processors—in case an in- 
consistent state was observed. Such 
checking typically requires sending, 
for each external coherence event, a 
signal up the cache hierarchy. The sig- 
nal snoops the load queue to check for 
an address match. Additional modifi- 
cations involve preventing cache dis- 
placements that could risk missing a 
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Figure 1. A simple way to generate a signature. 
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Figure 2. Executing chunks atomically and in isolation with signatures. 


Thread 1 


id B 
st T] W,=sig(T) 
id C]} R,=sig(B,c) 


(Waa R,) v (Won W,) 


| or runtime system. However, it does 


queues, L1 caches, and other critical | 


processor components must be aug- 
mented with extra hardware. 

In the Bulk Multicore, SC enforce- 
ment and violation detection are per- 
formed with simple signature inter- 
sections outside the processor core. 
Additionally, caches are oblivious to 
what data is speculative, and their tag 
and data arrays are unmodified. 

Finally, note that the Bulk Mul- 
ticore’s execution mode is not like 
transactional memory.° While one 
could intuitively view the Bulk Multi- 
core as an environment with transac- 
tions occurring all the time, the key 
difference is that chunks are dynamic 
entities, rather than static, and invis- 
ible to the software. 


High Programmability 

Since chunked execution is invisible 
to the software, it places no restriction 
on programming model, language, 


DECEMBER 2009 


enable a highly programmable envi- 
ronment by virtue of providing two 
features: high-performance SC at the 
hardware level and several novel hard- 
ware primitives that can be used to 
build a sophisticated program-devel- 
opment-and-debugging environment. 

Unlike current architectures, the 
Bulk Multicore supports high-per- 
formance SC at the hardware level. 
If we generate code for the Bulk Mul- 


| ticore using an SC compiler (such as 


the BulkCompiler'), we attain a high- 
performance, fully SC platform. The 
resulting platform is highly program- 
mable for several reasons. The first is 
that debugging concurrent programs 
with data races would be much easier. 
This is because the possible outcomes 
of the memory accesses involved in 
the bug would be easier to reason 


| about, and the debugger would in 


VOL,.52 


fact be able to reproduce the buggy 
interleaving. Second, most existing 
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software correctness tools (such as 
Microsoft’s CHESS") assume SC. Veri- 
fying software correctness under SC is 
already difficult, and the state space 
balloons if non-SC interleavings need 
to be verified as well. In the next few 
years, we expect that correctness-veri- 
fication tools will play a larger role as 
more parallel software is developed. 
Using them in combination with an 
SC platform would make them most 
effective. 

A final reason for the program- 
mability of an SC platform is that it 
would make the memory model of 
safe languages (such as Java) easier 
to understand and verify. The need to 
provide safety guarantees and enable 
performance at the same time has re- 
sulted in an increasingly complex and 
unintuitive memory model over the 


years. A high-performance SC memo- | 


ry model would trivially ensure Java’s 
safety properties related to memory 
ordering, improving its security and 
usability. 

The Bulk Multicore’s second fea- 
ture is a set of hardware primitives 
that can be used to engineer a sophis- 
ticated program-development-and- 
debugging environment that is always 
“on,” even during production runs. 
The key insight is that chunks and 


signatures free development and de- | 


bugging tools from having to record 
or be concerned with individual loads 
and stores. As a result, the amount of 
bookkeeping and state required by 
the tools is substantially reduced, as 
is the time overhead. Here, we give 
three examples of this benefit in the 
areas of deterministic replay of paral- 
lel programs, data-race detection, and 
high-speed disambiguation of sets of 
addresses. 

Note, too, that chunks provide an 
excellent primitive for supporting 
popular atomic-section-based tech- 
niques for programmability (such as 
thread-level speculation'’’ and trans- 
actional memory’). 


Deterministic replay of parallel pro- | 


grams with practically no log. Hard- 
ware-assisted deterministic replay 
of parallel programs is a promising 
technique for debugging parallel 
programs. It involves a two-step pro- 
cess.*” In the recording step, while 
the parallel program executes, spe- 
cial hardware records into a log the 
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The Bulk Multicore 

supports 

high-performance 
sequential memory 

consistency at 

low hardware 
complexity. 
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order of data dependences observed 
among the multiple threads. The log 
effectively captures the “interleaving” 
of the program’s threads. Then, in the 
replay step, while the parallel program 
is re-executed, the system enforces 
the interleaving orders encoded in the 
| log. 

In most proposals of determinis- 
tic replay schemes, the log stores in- 
dividual data dependences between 
threads or groups of dependences 
bundled together. In the Bulk Multi- 
core, the log must store only the total 
_ order of chunk commits, an approach 
| we call DeLorean." The logged infor- 
mation can be as minimalist as a list 
of committing-processor IDs, assum- 
ing the chunking is performed in a 
deterministic manner; therefore, the 
chunk sizes can be deterministically 
reproduced on replay. This design, 
which we call OrderOnly, reduces the 
log size by nearly an order of magni- 
tude over previous proposals. 

The Bulk Multicore can further re- 
duce the log size if, during the record- 
ing step, the arbiter enforces a certain 
order of chunk commit interleaving 
among the different threads (such as 
by committing one chunk from each 
| processor round robin). In this case 
of enforced chunk-commit order, the 
log practically disappears. During the 
| replay step, the arbiter reinforces the 
original commit algorithm, forcing 
the same order of chunk commits as 
in the recording step. This design, 
which we call PicoLog, typically incurs 
a performance cost because it can 
force some processors to wait during 
recording. 

Figure 3a outlines a parallel execu- 
tion in which the boxes are chunks 
and the arrows are the observed cross- 
thread data dependences. Figure 3b 
shows a possible resulting execution 
log in OrderOnly, while Figure 3c 
shows the log in PicoLog. 

Data-race detection at production- 
run speed. The Bulk Multicore can 
support an efficient data-race detec- 
tor based on the “happens-before” 
method” if it cuts the chunks at syn- 
| chronization points, rather than at 
arbitrary dynamic points. Synchroni- 
zation points are easily recognized by 
| hardware or software, since synchro- 
_ nization operations are executed by 
' special instructions. This approach 


is described in ReEnact"; Figure 4 in- 
cludes examples with a lock, flag, and 
barrier. 

Each chunk is given a counter 
value called ChunkID following the 
happens-before ordering. Specifi- 
cally, chunks in a given thread receive 


ChunkIDs that increase in program | 


order. Moreover, a _ synchroniza- 


tion between two threads orders the | 


ChunkIDs of the chunks involved in 
the synchronization. For example, in 
Figure 4a, the chunk in Thread 2 fol- 


lowing the lock acquire (Chunk 5) | 


sets its ChunkID to be a successor of 


both the previous chunk in Thread 2 | 


(Chunk 4) and the chunk in Thread 7 
that released the lock (Chunk 2). For 
the other synchronization primitives, 
the algorithm is similar. For exam- 
ple, for the barrier in Figure 4c, each 
chunk immediately following the bar- 
rier is given a ChunkID that makes ita 
successor of all the chunks leading to 
the barrier. 

Using ChunkIDs, we’ve given a 
partial ordering to the chunks. For 
example, in Figure 4a, Chunks 7 and 
6 are ordered, but Chunks 3 and 4 are 
not. Such ordering helps detect data 
races that occur in a particular execu- 
tion. Specifically, when two chunks 


from different threads are found to | 
have a data-dependence at runtime, | 
their two ChunkIDs are compared. If | 


the ChunkIDs are ordered, this is not 
a data race because there is an inter- 
vening synchronization between the 
chunks. Otherwise, a data race has 
been found. 

A simple way to determine when 
two chunks have a data-dependence 
is to use the Bulk Multicore signa- 
tures to tell when the data footprints 


of two chunks overlap. This opera- | 


tion, together with the comparison 
and maintenance of ChunkIDs, can 
be done with low overhead with hard- 
ware support. Consequently, the Bulk 
Multicore can detect data races with- 
out significantly slowing the program, 
making it ideal for debugging produc- 
tion runs. 

Enhancing programmability by mak- 
ing signatures visible to software. Final- 
ly,atechnique that improves program- 
mability further is to make additional 
signatures visible to the software. This 
support enables inexpensive monitor- 
ing of memory accesses, as well as 


Making Si 
Visible to 


contributed articles 


atures 
oftware 


We propose that the software interact with some additional signatures through three 


main primitives:"* 


The first is to explicitly encode into a signature either one address (Figure 1a) or all 
addresses accessed in a code region (Figure 1b). The latter is enabled by the bcollect 
(begin collect) and ecollect (end collect) instructions, which can be set to collect only 


reads, only writes, or both. 


The second primitive is to disambiguate the addresses accessed by the processor 
in a code region against a given signature. It is enabled by the bdisamb.loc (begin 
disambiguate local) and edisamb.loc (end disambiguate local) instructions (Figure 1c), 


and can disambiguate reads, writes, or both. 


The third primitive is to disambiguate the addresses of incoming coherence 
messages (invalidations or downgrades) against a given local signature. It is enabled 
by the bdisamb.rem (begin disambiguate remote) and edisamb.rem (end disambiguate 
remote) instructions (Figure 1d) and can disambiguate reads, writes, or both. When 
disambiguation finds a match, the system can deliver an interrupt or set a bit. 

Figure 2 includes three examples of what can be done with these primitives: Figure 
2a shows how the machine inexpensively supports many watchpoints. The processor 
encodes into signature Sig2 the address of variable y and all the addresses accessed in 
function foo(). It then watches all these addresses by executing bdisamb.loc on Sig2. 

Figure 2b shows how a second call to a function that reads and writes memory in 
its body can be skipped. In the figure, the code calls function foo() twice with the same 
input value of x. To see if the second call can be skipped, the program first collects 
all addresses accessed by foo() in Sig2. It then disambiguates all subsequent accesses 
against Sig2. When execution reaches the second call to foo(), it can skip the call if two 
conditions hold: the first is that the disambiguation did not find a conflict; the second 
(not shown in the figure) is that the read and write footprints of the first foo() call do not 
overlap. This possible overlap is checked by separately collecting the addresses read 
in foo() and those written in foo() in separate signatures and intersecting the resulting 


signatures. 


Finally, Figure 2c shows a way to detect data dependences between threads running 
on different processors. In the figure, collect encodes all addresses accessed in a 
code section into Sig2. Surrounding the collect instructions, the code places disamb. 
rem instructions to monitor if any remotely initiated coherence-action conflicts with 
addresses accessed locally. To disregard read-read conflicts, the programmer can 
collect the reads in a separate signature and perform remote disambiguation of only 


writes against that signature. 


Figure 1. Primitives enabling software to interact with additional signatures: 
collection (a and b), local disambiguation (c), and remote disambiguation (d). 
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Figure 2. Using signatures to support data watchpoints (a), skip execution of 
functions (b), and detect data dependencies between threads running on 


different processors (c). 
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novel compiler optimizations that re- 
quire dynamic disambiguation of sets 


of addresses (see the sidebar “Making | 


Signatures Visible to Software”). 


Reduced Implementation 
Complexity 


The Bulk Multicore also has advan- | 
tages in performance and in hardware | 


simplicity. It delivers high perfor- 
mance because the processor hard- 
ware can reorder and overlap all mem- 
ory accesses within a chunk—except, 
of course, those that participate in 
single-thread dependences. In partic- 
ular, in the Bulk Multicore, synchroni- 
zation instructions do not constrain 
memory access reordering or overlap. 
Indeed, fences inside a chunk are 
transformed 
Fences’ traditional functionality of 
delaying execution until certain ref- 
erences are performed is useless; by 
construction, no other processor ob- 
serves the actual order of instruction 
execution within a chunk. 


Moreover, a processor can concur- | 


rently execute multiple chunks from 
the same thread, and memory access- 
es from these chunks can also overlap. 
Each concurrently executing chunk 
in the processor has its own R and W 
signatures, and individual accesses 
update the corresponding chunk’s 
signatures. As long as chunks within 
a processor commit in program order 
(if a chunk is squashed, its succes- 
sors are also squashed), correctness is 
guaranteed. Such concurrent chunk 
execution in a processor hides the 
chunk-commit overhead. 

Bulk Multicore performance in- 
creases further if the compiler gener- 


ates the chunks, as in the BulkCom- | 


piler.' In this case, the compiler can 
aggressively optimize the code within 


each chunk, recognizing that no other | 


processor sees intermediate states 
within a chunk. 

Finally, the Bulk Multicore needs 
simpler processor hardware than cur- 
rent machines. As discussed earlier, 
much of the responsibility for mem- 
ory-consistency enforcement is taken 
away from critical structures in the 
core (such as the load queue and L1 
cache) and moved to the cache hierar- 
chy where signatures detect violations 
of SC.* For example, this property 
could enable a new environment in 
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Thread 2 


Thread 1 
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Release L SetFlag F 


Release L 
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which cores and accelerators are de- 
signed without concern for how to sat- 
isfy a particular set of access-ordering 
constraints. This ability allows hard- 
ware designers to focus on the novel 
aspects of their design, rather than 
on the interaction with the target ma- 


chine’s legacy memory-consistency | 


model. It also motivates the develop- 
ment of commodity accelerators. 


Related Work 

Numerous proposals for multipro- 
cessor architecture designs focus on 
improving programmability. In par- 
ticular, architectures for thread-level 
speculation (TLS)” and transactional 
memory (TM)° have received signifi- 
cant attention over the past 15 years. 
These techniques share key primitive 
mechanisms with the Bulk Multicore, 


notably speculative state buffering | 
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Figure 3. Parallel execution in the Bulk Multicore (a), with a possible 
OrderOnly execution log (b) and PicoLog execution log (c). 
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Figure 4. Forming chunks for data-race detection in the presence 
im of a lock (a), flag (b), and barrier (c). 


into null instructions. | 
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and undo and detection of cross- 
thread conflicts. However, they also 
have a different goal, namely simplify 
code parallelization by parallelizing 
the code transparently to the user 
software in TLS or by annotating the 
user code with constructs for mutual 
exclusion in TM. On the other hand, 
the Bulk Multicore aims to provide a 
broadly usable architectural platform 
that is easier to program for while de- 
livering advantages in performance 
and hardware simplicity. 

Two architecture proposals in- 
volve processors continuously execut- 
ing blocks of instructions atomically 
and in isolation. One of them, called 
Transactional Memory Coherence and 
Consistency (TCC),° is a TM environ- 
ment with transactions occurring all 
the time. TCC mainly differs from the 
Bulk Multicore in that its transactions 


are statically specified in the code, 
while chunks are created dynamically 
by the hardware. The second propos- 
al, called Implicit Transactions,” is 
a multiprocessor environment with 
checkpointed processors that regular- 
ly take checkpoints. The instructions 
executed between checkpoints consti- 
tute the equivalent of a chunk. No de- 
tailed implementation of the scheme 
is presented. 

Automatic Mutual — Exclusion 
(AME) is a programming model in 
which a program is written as a group 
of atomic fragments that serialize in 
some manner. As in TCC, atomic sec- 
tions in AME are statically specified 
in the code, while the Bulk Multicore 
chunks are hardware-generated dy- 
namic entities. 

The signature hardware we’ve in- 
troduced here has been adapted for 
use in TM (such as in transaction- 
footprint collection and in address 
disambiguation”*”'). 

Several proposals implement data- 
race detection, deterministic replay of 
multiprocessor programs, and other 
debugging techniques discussed here 
without operating in chunks.*!''>?° 
Comparing their operation to chunk 
operation is the subject of future work. 


Future Directions 

The Bulk Multicore architecture is a 
novel approach to building shared- 
memory multiprocessors, where the 
whole execution operates in atomic 
chunks of instructions. This approach 
can enable significant improvements 
in the productivity of parallel pro- 
grammers while imposing no restric- 
tion on the programming model or 
language used. 


At the architecture level, we are ex- | 


amining the scalability of this organi- 
zation. While chunk commit requires 
arbitration in a (potentially distrib- 
uted) arbiter, the operation in chunks 
is inherently latency tolerant. At the 
programming level, we are examin- 
ing how chunk operation enables 
efficient support for new program- 
development and debugging tools, 
aggressive autotuners and compilers, 
and even novel programming models. 
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| University CS departments are incorporating _ 


game design and development to prepare their 
students for the game industry’s expectations. 


| BY MICHAEL ZYDA 


omputer 

cience in the 

onceptual 
Age 


IN TECHNOLOGY, THE conceptual age is defined | 
oe nae or creative assets, including design, 
storytelling, artistry, empathy, play, and emotion. 
Good engineering or good computer science is no 
longer enough; de sign must be just as good. 

The transition fro1 m information age to conceptual 
age has been overlooked by most academics in 
computer science, yet many of the consequences of 
the transition have been apparent for the past decade 
at least, including a drop in undergraduate interest, 
the outsourcing of U.S. programming jobs, and the 
decline in research laboratories focused on advanced 
computer science research. Today, another aspect 

of the transition is emerging: integration of game 
development into computer science curricula. Here, 

I discuss what it looks like, how it affects computer 
science departments, and how it helps drive the 
overall transition. 
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The last quarter of the 20th cen- 
tury saw the U.S. economy transition 
from the industrial age to the infor- 
mation age, a move characterized 


by major changes in profession, as 


many people chose to be knowledge 
workers, rather than manual labor- 


ers. Computer science was the driving 


intellectual, social, and cultural force 
behind the information age. Comput- 
er researchers and developers spear- 
headed widespread adoption of new 
technology and were paid well. Aca- 
demic computer science departments 
grew in terms of numbers of students, 
faculty, and facilities; almost all ma- 
jor universities received donations, 
including new “computer science 


| buildings,” and enrollment was so 


large that standardized tests had to 
be created for the field. These were 
the fattest of times, when computer 
science research drove the growth 
in the national, as well as the global, 
economy. 

The dot-com crash closing out 
the 20th century signaled the end of 
the era. Computer science was now 
viewed as an unstable career choice, 


| with corresponding drop in interest 


by young people.* Additionally, many 
computer science departments heard 


_ complaints from industry that their 


recent graduates were unprepared to 
be part of the modern work force. So, 
following their economic self-inter- 


| est, students migrated to other fields, 


and the leading U.S. universities saw 
major growth in undergraduate busi- 
ness programs. 

We can hypothesize that the drop 


| in interest in computer science was 


part of the natural ebb and flow and 
fashion of career choice. But doing so 
is to stick one’s head in the sand and 
hope for the return of yesteryear. A 
more progressive view is to focus on 
the concurrent transition from infor- 
mation age to conceptual age. The old 


Scene from the Artemis Chronicle PC game 
built with Microsoft’s XNA toolkit and 

the USC GamePipe Laboratory NitroX 
game engine. 
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focus on “tractable abstractions sepa- 
rate from real problems”? is no longer 
acceptable. In the conceptual age, we 
need to change the direction of the 
field or continue an unwelcome slide 
toward irrelevance. 


Cognitive and Creative Assets 

The conceptual age of technology is 
defined by cognitive and creative as- 
sets, with the design side being just 
as good. We see harbingers in recent 
business success. For example, Ap- 
ple has done well with design-driven 
products enabled by great engineer- 
ing. On the other hand, Microsoft is 
a great engineering company that 
deemphasizes design at its own peril; 
the Vista operating system, despite 
its technical success, was not built 
with user experience as its ultimate 
goal and had great difficulty with re- 
spect to adoption. Much of the rest of 
the traditional computing industry is 
shrinking, but the game industry is a 
segment that continues to grow due 
to its focus on design backed up by 
great supporting engineering. The de- 
mand for computer scientists capable 
of building games is high, with large 
companies like Electronic Arts report- 
ing that 65% of their hiring demand 
is for programmers skilled in build- 
ing games. Unfortunately, the kind 


of computer scientist required by | 
the game industry is not exactly what | 


traditional computer science depart- 
ments produce. 

The game industry wants gradu- 
ates who are strong programmers and 
system developers, skilled in game de- 
sign, and capable of and experienced 
in game development in large, cross- 
disciplinary collaborative teams. 
Some computer science departments 
produce graduates who are strong 
programmers and system developers, | 
but most do not produce graduates 
skilled in game design. Moreover, | 
most do not produce graduates facile 
in large, cross-disciplinary collabora- 
tive teams. The typical computer sci- | 
ence graduate has little experience 
with team software development be- | 
yond one or two projects with three to 
five other computer science students. 

Computer science departments | 
can retool themselves to meet these 
challenges, but, for game develop- 
ment, doing so requires a strong, ex- 
perienced champion and proper re- 
sources. Here, I discuss a particular 
approach we take at the University of 
Southern California, outcomes from 
that program, and questions with re- 
spect to transitioning a mature field 
toward the conceptual age. 


Game Development 

and CS Education 

If computer science departments are 
indeed to make the transition, what 


will they transition into? The game 
industry also wants graduates with a 
strong background in computer sci- 
ence. It does not want graduates with 
watered-down computer science de- 
grees, but rather an enhanced set of 
skills. This is good news for the de- 
partments, meaning they can transi- 
tion some courses to new material 
or new foci, create new courses, and 
still not abandon decades of hard- 


| won knowledge. The best way to think 


about the transition to the conceptual 
age is to make the focus the “big idea 
or big concept,” with a follow-on fo- 
cus on how to build the concept and 
with what technologies. 

Strong programming skills is the 
first item on the list, meaning an 
undergraduate program’s first four 
computer science courses—CS-101 
Fundamentals of Computer Program- 
ming, CS-102 Data Structures, CS- 
105 Object Oriented Programming, 
and CS-201 Principles of Software 
Development—must be taught in a 
rigorous manner by excellent practic- 
ing programmers. While computer 
programming languages abound, 
and the historic computer science at- 
titude “You learn one language, and 
it’s easy to pick up the next one” is not 
shared by industry. The game industry 
will also tell you that it wants the first 
four programming classes in C++, not 
Java, according to M.M. McGill’ and 
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my own private communications with 
directors of human resources in major 
game-development companies. Many 
universities switched to Java with the 
Internet boom, as it is easy to resource 


a programming laboratory for Java, | 


and Java support for the user interface 
is extensive. But the game industry 
programs mainly in C++. Many game 
companies say they will not interview 
or hire someone whose first program- 
ming language is Java. Computer 
scientists might argue about the vir- 
tues of programming languages, but 
most console and PC games are built 
in C++. From a C++ perspective, our 
students can teach themselves other 
useful languages, learning Objective- 
C for iPhone game development or 
C# for games with XNA in a single se- 
mester. The industry is basically say- 
ing it wants strong programmers with 
multiple courses in C++, its primary 
development language. The USC ex- 
perience is that 100% of its students 
interviewed for programming posi- 
tions are given three-to-four-hour- 
long programming tests, with almost 
all companies administering the tests 
in C++. We had one company (NCSoft) 
test in C and another (Microsoft) indi- 
cate the test could be done in C++, C, 
or C#. Programming interns/job seek- 
ers from our program Spring 2009 (35 
interviewed in the game industry) 
found no companies administering 
programming tests in Java. 

Strong skills in system develop- 


Programming 
_interns/job seekers 
from our program 

| Spring 2009 (35 
interviewed in the 
game industry) 


administering 
programming tests 
in Java. 


ment is another must. The figure here 


outlines the components that must be 
juggled by a programmer developing a 
typical networked game, touching on 
much of computer science and then 
some. Moreover, the game software 
must run multithreaded in a well-bal- 
anced manner and provide an immer- 
sive and entertaining experience to 
the game player. Game development 
is viewed by some as systems design. 

Programmers comfortable in cross- 
disciplinary groups is third on the list. 
Industry wants programmers who are 
able to generate software based on 
the vision of designers, work with art- 
ists to generate the right display and 
feel, and know how to participate in 
large-team development efforts. The 
traditional computer science student 
is far from this. 


Computer science departments 
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must provide the necessary experi- 
ence so game programmers are in- 
dustry-ready on graduation day. This 
means that graduates must have built 
a significant game by collaborating 
| with other students, not all of whom 
| have backgrounds in computer sci- 
ence. Grads should also be familiar 
with the pipeline-development pro- 
cess and asset-management systems, 
_ both aspects of game-industry devel- 
opment potentially useful throughout 
the program. Such experience is very 
different from traditional computer 
science-degree programs in which 
software-development teams are 
_ small, and there is no strong require- 
| ment forasset management or source- 
code base sharing or versioning. 


Big Game Program or Baby Steps? 
Computer science departments 
changing their focus toward game 
development is an obvious approach 
toward preparing students for the 
conceptual age. Many universities 
are building game-development pro- 
| gramswithinoralignedwiththeircom- 
| puter science departments.'*°%”° 
The annual Foundations of Digital 
Games conference (http://www.foun- 
dationsofdigitaleames.org/) focuses 
on this important transition. 

How does all this directly affect the 
departments? Consider two separate 
efforts: the USC GamePipe Laborato- 
ry and the University of Washington, 
Bothell, the latter covered in the arti- 
cle “Computer Games and Computer 
Science” by Kelvin Sung (on page 74). 


USC GamePipe Laboratory 
The USC Department of Computer 
Science offers both a bachelor’s de- 
gree in computer science (games) and 
_ amaster’s degree in computer science 
_ (game development).'? Students inter- 
ested in Ph.D.-level topics are encour- 
_ aged (for the moment) to apply to the 
traditional computer science Ph.D. 
program. The bachelor’s in computer 
science (games) program includes 37 
units of traditional computer science 
and 42 units of game-development 
courses. The computer science cours- 
es are the same as those taken by 
regular students in USC’s bachelor’s 
computer-science program, except 
for the following modifications: 
Programming courses. We rewrote 
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three of the first four programming 
courses to be in C++ and created game- 
oriented examples and exercises. The 
game focus helps motivate students, 
getting them excited about program- 
ming. For Fall 2008, results showed 
a 28% increase in the number of stu- 
dents with letter grade A in the games- 
oriented CS-101 Introduction to the 
Fundamentals of Programming and 
for Spring 2009 a 49% increase (http:// 


gamepipe.usc.edu/~ zyda/GamePipe/ | 


Ghyam-Final-MS-Study-2009.pdf). Fur- 
ther analysis of the results is under- 
way to determine whether they reflect 


superior skills, superior understand- | 


ing of programming, superior motiva- 
tion, or some other cause; 

Replaced EE with CS. We replaced 
four electrical-engineering courses 
on circuit design with computer-sci- 
ence-focused EE-352 Computer Orga- 
nization & Architecture and parallel- 
programming-focused EE-452 Game 
Hardware Architecture. The removed 
courses represent an older style of 
computer-science-degree program. 
We felt it more important that stu- 
dents learn how computer architec- 
tures affect programming rather than 
how to make such architectures. We 
also felt that parallel programming 
was highly relevant to both the multi- 
threaded nature of modern game de- 
velopment and multicore processors; 

Added EE. We added EE-450 Intro- 
duction to Computer Networks as a 
degree requirement. Networking is 
offered as an elective in the regular 
computer science curriculum. Be- 
cause games need networking, all our 
students take it; 

Cut compiler courses. We eliminated 
the two compiler courses taken by reg- 
ular computer science students. ACM 
eliminated compilers as a require- 
ment from the CS core in 1979. The 
USC Computer Science Department 
uses the two courses as large program- 
ming-project capstone courses, so we 
felt we would rather have our students 
build games. The replacement for the 
games curriculum is the two-semes- 
ter CS-491A/B Final Game Projects 
course. An interesting result is that 
the Computer Science Department 
is weighing whether to allow general 
computer science students to take the 
Final Game Projects course instead of 
the compiler sequence; and 
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programming was 
highly relevant to 


| the multithreaded 


nature of modern 
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and multicore 
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No foreign languages. General edu- 
cation requirements for the bach- 


_ elor’s degree in computer science 


(games) are approximately the same 
as for USC’s regular computer sci- 
ence degree. The degree lacks a four- 
course foreign-language requirement 
as in all other USC College degrees, 


_ an accident of planning rather than a 


| game programming, 


recommendation. 

When we began planning the de- 
gree, the dean of engineering said, 
“T don’t want a weak degree.” So we 
made these changes to the computer 
science component of the program 
and are confident we offer an aca- 
demically strong and industry-viable 
undergraduate degree. 

For the games-side of the bach- 
elor’s degree, we replaced 42 units of 
electives from the general computer 
science degree with a full course in 
game development. Hence, the degree 
looks more like a double major than a 
specialization. We also have a set of 
courses on game engineering, game 
design, and game cross-disciplinary. 

Game engineering covers video- 
parallel pro- 
gramming on consoles and graphics 
processing units, and programming 
game engines, all of which are 
straightforward software-develop- 


| ment courses and all important for 


| game development. 


For game design, we send our 


| students to a three-course Game De- 


sign Workshop sequence in the USC 


| School of Cinematic Arts Interactive 


Media Program. Engineers are im- 
mersed in gameplay design as taught 
by master game designers. The first 
course in the sequence—CTIN-488 
Game Design Workshop—teaches 
students how to prototype gameplay 
using cardboard and hand-drawn art; 
they basically build board games. We 
get an interesting response from the 
engineers we send there. Some rave 
about it, saying it’s the best thing 
they’ve ever taken. Some hate it, feel- 
ing frustrated they cannot immedi- 


_ ately code-up a game. Some hate the 
_ first few weeks but in the end come 


back and say it was a great course. 


| Ultimately, the students who under- 


stand the importance of the course 
and express satisfaction end up with 
great internships/jobs in the game 
industry. It’s where we see the future 


game industry technical directors of 
the conceptual age. 

Our approach to creating a cross- 
disciplinary program was to design 
courses that could be taken by non- 
computer scientists, as well as by 
computer science majors. First-se- 
mester undergraduates survey game 
play, from the start of games to using 
(among other things) old consoles, 
old PC games, and emulators. Stu- 
dents come out of the course hooked 
on our degree program and wanting 
more. They then take a video-game 
production course to build individual 
games using GameMaker and hear 
from industry speakers on game de- 
velopment. Next is CS-281 Pipelines 
for Games & Interactives in which they 
learn how to create assets for games, 
including 3D models and anima- 
tions. One achievement is a pipeline 
asset and source-code management 
system we designed that is shared by 
all game-development courses in the 
program. We teach our students how 
to use it early on, simplifying and en- 
hancing their ability to develop games 
in subsequent courses. We also place 
all our students in a semester-long 
character-animation course. 

In their final one-and-a-half years 
before graduation, our students be- 
come ready for game development. 
All take a course developing a serious 
game ina large team for an interested 
sponsor. Their last year before gradu- 
ation is in CS-491A/B Final Game Proj- 
ects building games in large cross-dis- 
ciplinary teams from August to May, 
with game designs selected through 
a design playoff the previous May. 
The Final Game Projects course is 
administered jointly by the School of 
Cinematic Arts and has students from 
computer science (bachelor’s and 
master’s students), interactive media 
(bachelor’s of fine arts and master’s of 
fine arts), fine arts (bachelor’s of fine 
arts), animation (bachelor’s of fine 
arts), music composition, and film 
scoring. Teams in this class include 
from 11 to 25 students building sig- 
nificant games over its two-semester 
run, aiming for contest submission by 
the end of the second semester. 

Strong cross-disciplinary collabo- 
ration occurs, with results presented 
at the end of each semester on Demo 
Day when game-industry executives 


are invited to review the students’ 
work; the accompanying screenshots 
are indicative of the visual quality. 
Artemis Chronicle, a beautiful action- 
adventure title (see page 67), demon- 
strates the powerful features of the 
NitroX Engine, a revolutionary, com- 
plete development framework for 
creating XNA games. Both the game 
and the NitroX engine were built in 
the CS-491AB Final Games course 


beat-matching iPhone game (below), 
incorporates a student-designed al- 
gorithm for automatically computing 
beat detection from any song import- 
ed into the game. And Slice, an action 
role-playing game (below), puts bat- 
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over two semesters. Air Guitar God, a | 


contributed articl 


tles at your fingertips on the iPhone. 
Videos of the most recent Demo 
Day are at http://gamepipe.usc.edu/ 
USC_GamePipe_Laboratory/R%26D/ 
R%26D.html. 

At the end of each academic year, 
we now routinely place large numbers 
of students (typically around 35) in 
internships/jobs in the game indus- 
try where they are nearly instantly 
productive. In the Fall semesters in 
2007 and 2008, a team on Demo Day 
was offered a commercial deal to turn 
their game into a company for further 
development or prepare to ship com- 
mercially. Spring 2009 included five 
student-built games under commer- 
cial consideration. 


Opening screen from Air Guitar God, an iPhone beat-matching game 
developed by students in the USC GamePipe Laboratory's mobile games course. 
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by students in the USC GamePipe Laboratory. 


DECEMBER 2009 


VOL. 52 


Scene from Slice, an iPhone gesture-based fighting game developed 
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Industry wants programmers comfortable in cross-disciplinary teams. 


Effect on the USC CS Department 
How has the game-degree program 
affected USC’s Computer Science De- 
partment, nudging it along toward the 
conceptual age? First, I must say we 
have not dismantled the traditional 
bachelor’s program in computer sci- 
ence, and students continue to enroll 
in it. Computer science and games 
students share almost the same core 
computer science curriculum. 

Enrollment concerns were part of 
our original motivation with respect 
to the games program, though they 
have eased, as they have for many 
departments, according to the Com- 
puting Research Association’ annual 
Taulbee Survey (http://www.cra.org/ 
statistics/). For the Fall 2009 semes- 
ter, 29% of the students in the USC 
bachelor’s in computer science pro- 
gram are in the games program, rep- 
resenting an important influence on 
the department. 

While we have not measured it, 
probably the greatest effect we see is 
an apparent “joy of computing” feel- 
ing by our students who come to class 
highly motivated, pour their best 
ideas into their projects, and produce 
spectacularly creative results, some 
of which the game industry wants to 
commercialize. The graduating stu- 
dents who move into positions in the 
game industry return to subsequent 
Demo Days, bring their bosses, and 


tion of a creative-design component 
and making it student-driven and stu- 
dent-dependent is key to this success. 
Students take ownership of their edu- 
cational program and aspire to make 
everything they do shine. 

Another component worth men- 
tioning is the commitment of the pro- 
gram’s faculty and instructors. The 
students are in small classes where 
they have much say in direction and 
result. The faculty is available to pro- 
vide technical guidance, mentorship, 
motivation, and support. It is physi- 
cally draining if done properly, but if 
the students recognize their passion 
for games, they cannot help but be 
passionate game developers as well. 
Faculty working directly with our pro- 
gram are the executive producers of 
some 12 to 14 games per semester. 
The results of these efforts are visible 
onthe USC GamePipe Web site (http:// 
gamepipe.usc.edu). 

We have also been able to create 
a line of research funding based on 
and around games.'! It is more than 
difficult to build an R&D program on 
games without a pipeline of students 
learning to build them. Today, we have 
both. The games program has addi- 
tionally strengthened the reputation 
of USC’s Computer Science Depart- 
ment in terms of increased R&D fund- 
ing, improved hiring rates for gradu- 
ates, attention to the games program 


hire more of our graduates. The addi- | in the press, invitations to games fac- 
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ulty to speak at conferences and pub- 
lish in traditional venues, and other 
universities desiring to copy USC’s 
success. In Fall 2008, the director of 
the USC GamePipe Laboratory was ap- 
pointed an ACM Distinguished Speak- 
er (http://www.dsp.acm.org/) in recog- 
nition of the program’s achievement. 
Hiring managers and developers in 
the games industry now regard us as 
one of the “best games programs,” 
though we are only in our fourth year 
of operation. Moreover, traditional 
USC computer science faculty not cur- 
rently involved in the games program 
have begun to ask how they can partic- 
ipate, some have changed their course 
projects to be game-related, and many 
ask how we can draft proposals to- 
gether. The computer science faculty 
realizes that something important is 
happening, with some beginning to 
also guide their own programs toward 
the conceptual age. ic] 
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Modernization of all vertical services industries including finance, gov- 


ernment, media, communication, healthcare, insurance, energy and... 


more efficiently and effectively. 


IEEE 8th International Confer- 
ence on Web Services (ICWS 2010) 


As a major implementation tech- 
nology for modernizing services 
industry, Web services are Inter- 
net-based application components 
published using standard interface 
description languages and universally available 
via uniform communication protocols. In the 
eighth year, the program of ICWS 2010 will con- 
tinue to feature research papers with a wide 
range of topics focusing on various aspects of 
implementation and infrastructure of Web-based 
services. ICWS has been a prime international 
forum for both researchers and industry practi- 
tioners to exchange the latest fundamental ad- 


vances in the state of the art on Web services. 


http://www.icws.org 


_ration, services monitoring, services optimization, as well as services management. The goal of Ser- 


i\vices Computing is to enable IT services and computing technology to perform business services 


IEEE 7th international Gonfercnce on Services Gamputnig (SCC 201 
In the modern services and software industry, Services Computing has become a 
cross-discipline that covers the science and technology of bridging the gap between 
Business Services and IT Services. The scope of Services Computing covers the whole 
lifecycle of services innovation research that includes business componentization, ser- 
vices modeling, services creation, services realization, services annotation, services 


deployment, services discovery, services composition, services delivery, service-to-service collabo- 


http://conferences.computer.org/scc/2010 


IEEE 3rd International Confer- 
ence on Cloud Computing 
(CLOUD 2010) 


Cloud Computing is becoming a 
-scalable services delivery and con- 
i 


‘sumption platform in the field of 
| 


aie Computing. The technical 


foundations of Cloud Computing 
include Service-Oriented Architecture (SOA) 
and Virtualizations of hardware and software. 
The goal of Cloud Computing is to share re- 
sources among the cloud service consumers, 
cloud partners, and cloud vendors in the cloud 
value chain. Major topics cover Infrastructure 
Cloud, Software Cloud, Application Cloud, and 


Business Cloud. 


http://www.thecloudcomputing.org/2010 
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the push for presenting abstract CS 


Integrating computer games into existing cs oan : the context of familiar real- 
H i tions. 
courses may help attract students to the field, 7")... sbstra, 


Relating abstract principles to real- 


but there are guidelines to be considered. world experience has become increas- 
aes : oe ingly prominent in mathematics and 
| BY KELVIN SUNG general science education. For exam- 


_ = : ple, the Calculus Reform movement of 
the 1990s included both pedagogical 


changes and foci on real-world prob- 
lems, while the Carl Wieman Science 
Education Initiative at the University 


of British Columbia has redesigned its 
freshmen introductory physics course 


such that: 
‘As much as possible, the standard 


introductory physics material will be 


eal bial presented in connection with real-world 
situations and issues such as home 
heating, transportation, and electricity 


generation.” 
In the CS education arena, the Media 


Computation of Georgia Tech'® is an 
excellent example where foundational 


programming conceptsare presented in 
the context of popular digital multime- 
dia applications. This contextualization 
of computing education"’ is an ongoing 
effort and interactive computer video 
games, being one of the most familiar 
application areas for our students, is a 
context favored by many CS educators. 
This article presents the USC Game- 
Pipe Laboratory effort where the entire 
CS curriculum is redesigned in the con- 
text of game development (Please refer 
to the USC GamePipe Laboratory effort 
by Michael Zyda on page XXX where the 
CS curriculum is designed in the con- 
text of game development). This article 
examines the ongoing efforts to inte- 
grate computer video games in existing 
traditional CS courses. The discussion 
is divided into introductory program- 
ming courses and elective CS courses, 
and concludes with guidelines for con- 
sidering integrating computer game 
content into existing CS classes. 


eval U ate d 5 fro mt h e Games and CS Classes 
ion, to the fundamental There are many types of games that 


he curriculum design. One of 


a_http://www.cwsei.ube.ca/departments/physics- 
astro courses.htm (Nov. 2007 update). 
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are suitable for teaching CS subjects 
including many noncomputer games 
or games that are based on dedicated 
devices (for example, Lego robots). Our 
focus is on interactive graphical com- 


puter games. It is important to recog- | 


nize that in the CS education arena the 
term “computer game” is often used to 


refer to the attempts at, and the results | 
of, effective and meaningful integra- | 
tion of animated graphical visualization | 
and various degrees of interactivity. Be- | 


cause of the unknown entertainment 
value, strictly speaking, much of these 


are interesting and innovative teaching | 


materials and are not computer games 
in acommercial sense. 

As discussed in Sung et al.,*? when 
examining recent efforts in integrating 
computer games into CS classes, we 
observe three general categories. 

1. Game development classes. These 
are entire curricula,’””’ individual 
classes,"°"3'%° or capstone projects””” 
designed specifically to develop new 
games as an end product. The educa- 
tion committee of the main profession- 
al organization for the games industry, 
The International Game Developer As- 
sociation (IGDA), has proposed a com- 
prehensive curriculum framework 
conveying the industry’s articulated 


desires for well-trained college gradu- | 


ates seeking jobs in the game industry. 
When evaluated against the IGDA cur- 
riculum framework, we see that these 
classes cover all the major core topic 
areas. Students in these classes must 
be concerned with all aspects of pro- 


ducing a real game including enter- | 


tainment value, visual quality, audio 
effects, physics simulations, and real- 
time performance. 

2.Game programming classes. 
These are classes (for example, Kuff- 
ner’s CMU course*’) designed specifi- 
cally to study technical aspects and is- 
sues involved in building games. For 
example, topics covered may include 


event loops, path planning algorithms, | 


and terrain representation. These 
classes typically do not require building 
an end product and the topics covered 


are general and usually are applicable | 


to different domains. These classes 
concentrate on covering the game pro- 


gramming topic area in the IGDA cur- | 


riculum framework. 
3. Game development client. These 
are existing CS classes that creatively in- 
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tegrate games into their existing curric- 
ulum. Typically, games are used as pro- 
gramming —assignments,'992775333437 
or to teach abstract concepts,!!1>°° 
or as an example application area to 
teach the concepts involved in an en- 
tire topic area.’ These are traditional 
CS classes that exist independent of 


game programming. These classes are | 


actually clients of game development 
where they use game development as a 
vehicle to deliver specific abstract con- 
cepts. After these classes, students are 
expected to understand the abstract CS 
concepts, and not the details of game 
development. 

Courses in the first two categories are 
new courses designed to teach students 
about game development. Over time, as 
the game development field matures, it 
is expected that these courses will evolve 
and eventually some of the contents 
will become part of the standard CS 


curriculum. This is not unlike the early | 


years of many existing disciplines in CS 
(for example, software engineering’ or 
computer graphics*), where the syllabi 
of pioneering courses consolidated as 
the disciplines mature. Courses in the 
third category, the “game development 
clients,” are traditional CS courses that 
can be found in existing CS curricula. 
The earliest work in this area!” adapt- 
ed games almost anecdotally without 
holistic considerations; most of the 
more recent work is structured around 
addressing core competency areas with 
reference to the ACM Curriculum. Ac- 
cordingly, courses in the “game devel- 
opment clients” category can be divid- 
ed into two broad efforts: introductory 
programming classes (CS1/2) and ad- 
vanced/elective classes. 


Games and Introductory 
Programming Classes 
Many CS educators recognized and 


effort required by faculty, existing work | 


done in this area can be classified into 
three broad approaches: 

Little or no game programming.” In 
these courses students learn by playing 
custom games but they do not actually 
program the games. 

Per-assignment game  develop- 
ment.>?!?5°8 All these classes devel- 
oped games as part of individual pro- 
gramming assignments. In each case, 
isolated games are designed around 
technical topics being studied. 


Extensive game development. For ex- | 


ample, faculty must design program- 
ming assignments based on custom 
library, general game engines,’ dedi- 


cated game engines,” specialized pro- _ 


gramming environments,” custom 
object-oriented class hierarchies,” spe- 
cific curricula,” or new programming 
languages." 

Much of this work reported re- 
sounding successes with drastically in- 
creased enrollments and student suc- 
cesses.*''”5 Based on these results, it is 
well recognized that integrating com- 
puter gaming into CS1 and C82 (CS1/2) 
courses, the first programming courses 
students encounter, is a promising 
strategy for recruiting and retaining po- 


tential students. With the enrollment | 


challenges faced by the CS discipline, 
it is desirable and important that this 
strategy can be adopted widely by all in- 
terested faculty and departments. 
However, most of the existing work 
in this area is based on pioneering ex- 
ploratory projects by faculty members 
with expertise in computer graphics 
and gaming.*”8 With few exceptions, 
these projects are student-centric where 
the main goals of study are student 
engagement and various learning out- 


| comes. Adaptability and generality of 


took advantage of younger generations’ | 
familiarity and interests for computer | 


video games and integrate related con- 
tents into their introductory program- 
ming courses. Because these are the 
first courses students encounter, they 
build excitement and enthusiasm for 
our discipline.*"” Based on the type of 


b It is important to reiterate that, after these 
classes students are expected to understand 
abstract programming concepts rather than 
concepts specific to building games. 
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the resulting materials are usually not 
main concerns. For the faculty mem- 
bers teaching CS1/2 courses, most of 


tees to arrive at consensus for signifi- 
cant modifications to CS1/2 courses, 
especially if the modifications involve 
computer games. For these reasons, 
to be widely adaptable, game-related 
CS1/2 materials should be designed 
with the following considerations: 

1. The materials should not demand 
knowledge in computer games or 
graphics. 

2. The materials should include in- 
dependent modules that are limited in 
curriculum scope. 

3. The materials should support se- 
lective experimentation by individual 
faculty members in small-scale pilot 
demonstration projects in their exist- 
ing courses. 

Selective Gradual Adoption. Results 
from the extensive game development 
approach discussed previously typi- 
cally include large amounts of adopt- 
able/adaptable courseware materials. 
However, using these materials often 
requires a significant investment of 
time, for example, understanding a 
game engine, or significant reworking 
of an instructor’s existing curriculum. 
Because of the considerable overhead, 
results from this approach are typically 
not suitable for selective adoption. 

In terms of suitability for selective 
adoption, we expect that results from 
the per-assignment game development 
approach will be most applicable. For 
example, one could selectively replace 
nongame assignments in existing 
classes by the corresponding games 


| assignments. However, because of the 


which are without computer graphics or | 


gaming background, it can be challeng- 
ing to take advantage of these results. 
In addition, when considering ex- 
perimentation with CS1/2 courses, it is 
important to appreciate institutional 
oversight procedures. Though becom- 
ing less controversial in recent years, 
many CS educators continue to be un- 


sure about integrating gaming in for- | 


mal educational settings.” It can be 


challenging in departmental commit- | 
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pioneering nature of work in this area, 


many of the results on per-assignment 
game developmentare “anecdotal” and 
do not discuss the impact of such as- 
signments on the CS1/2 curriculum ho- 
listically. For example, the results from 
Huang only involve turn-based strate- 
gic games,” Ross only discusses puzzle 
games,” and the discussion from Val- 
entine is based on a single game.** 

The Game-Themed Introductory Pro- 


| gramming Project at the University of 


Washington, Bothell‘ is specifically de- 
signed to address these issues. In the 
first phase of our project, we have de- 
signed and built general game-themed 
CS1/2 programming assignment mod- 
ules that demand no existing knowl- 


c_ http://depts.washington.edu/emmr/Research/ 
XNA Games/index.php 


edge of games or graphics from the 
faculty,** and have demonstrated it re- 
quires minimum changes to existing 


classes in order to successfully adopt | 
these materials. Currently in the sec- | 


ond phase of our project, we are build- 
ing game-themed examples and _ tu- 
torials designed to provide a pathway 
for interested faculty to gradually in- 
corporate game-related materials into 
their existing courses. Our project is 
student-centric because our materials 
allow students to practice CS concepts 


ina more real-world-like context. More | 


importantly, the materials are also fac- 
ulty-centric because these materials are 
the stepping-stones for faculty to begin 
experimenting with a promising new 
approach to teaching CS1/2 courses. 


Games and Elective CS Courses 

As highlighted earlier, the CS education 
community has a sound understand- 
ing of how to integrate visualization 
and interactivity in delivering CS1/2 
content and has achieved impressive 
successes. In comparison, there is a 
relatively modest amount of work done 
in integrating computer games into 
existing traditional CS elective classes. 
This is not surprising as a successful 
systematic integration requires the de- 
livery of an entire technical topic area 
to lend itself well in visualization and 


While it is the 

case that proper 
integration of game 
development and 
game content in CS 
classes have the 
potential to further 


engage students 
_resulting in higher 
success rates, it is 


interactivity. There are anecdotal ex- | 


amples of using game content in deliv- 
ering selective topic areas (for example, 


design patterns,'*’’ or spatial search | 


algorithms’). These are small-scale 
projects not meant to address entire 


courses as identified in the standard | 


CS curriculum. 


Artificial intelligence (AI),’ software _ 


engineering (SE),'°*’ and computer 


graphics (CG)*° are examples of elec- | 


tive courses where published results 
describe attempts at systematically in- 
tegrating game development. In all of 
these cases, the stated student learning 
outcomes are similar to those from the 
typical CS curriculum and do not in- 
clude competencies involved in game 
development as defined by the IGDA 


curriculum framework. In these class- | 


es, students study the core topic areas 
and implement games to demonstrate 
their understanding of the fundamen- 
tal CS concepts. This work reported 
high student engagement and enthusi- 


asm, while pointing out that the faculty | 


not the case that 
any game content 
will result in having 
a positive impact. 
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members involved must develop large 


- amounts of software infrastructure to 
_ facilitate and support students’ game 


development. 

Notice that all three of these topic ar- 
eas have significant overlaps with com- 
puter games in general: intelligent be- 
havior (AI) is one of the most important 
attributes of modern games, SE meth- 
odologies are applicable in any soft- 
ware product development, and topics 
in CG are the conceptual framework for 
visualization in games. One can argue 
that for these topic areas, it is relatively 
straightforward to integrate game con- 
tent in a consistent manner. In general, 
for topic areas that do not offer obvious 
overlaps with computer games or game 
development (for example, compiler or 
programming languages), dedication 
and creativity would be required to de- 
velop the elaborate infrastructure and 


_ to systematically integrate the new con- 


tents. In these cases, one should care- 
fully examine the trade-offs between 
required efforts, expected benefits, and 
consider other perhaps more appro- 
priate practical contexts (for example, 
popular applications on the Internet). 


Guidelines for Consideration 

While it is the case that proper integra- 
tion of game development and game 
content in CS classes have the potential 
to further engage students resulting in 
higher success rates, it is not the case 
that any game content will result in 
having a positive impact. In addition, 
when exploring the potential for devel- 
opment or adoption of game content, 


- we must work within the bounds of in- 


stitutional oversights and be conscious 
about the expertise areas of faculty 
members. The following are some fac- 
tors for consideration: 

> Institutional oversight. Depart- 
mental committee consensus is often 
required for significant changes to core 
courses. Because of the potential im- 
pact, it can be especially challenging to 
arrive at a consensus for modifications 
to introductory-level courses like CS1/2. 
A strategy is to design limited-curricu- 
lum-scope experiments to gain experi- 
ence (and collect results) to assist the 
committee’s decision-making process. 

> Faculty background. Many faculty 
members did not grow up playing com- 
puter games and most are not familiar 


| with graphics programming. When 
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developing or evaluating materials for 
adoption, it is essential to pay attention 
to the prerequisite knowledge. An ideal 
approach would be to clearly separate 
and hide the graphical and user inter- 
activity functionality. In this way, fac- 
ulty and students only need to concen- 
trate on the core CS concepts. 

» Gender and expertise neutrality. As 
with any powerful tool, inappropriate 
use of games can backfire and result in 
further alienation of underrepresented 
groups.” It is important that the gam- 
ing materials are gender and expertise 
neutral. For example, it is important to 
avoid violence and unnecessary com- 
petitions.*° The materials used should 
discourage the addition of superfluous 
“eye-candy” graphical enhancements, 
or user interaction programming by 
students with extensive prior program- 
ming experience. Doing so will help to 
avoid intimidating other less-experi- 
enced students. 

» Infrastructure support. Free and 
simple are the keywords here. Given 
the financial reality of most schools, 
all materials must be freely available; 
the associated institutional infrastruc- 
ture requirements must be modest and 
straightforward. 

» Conceptual integrity. Our discus- 
sion focuses on the traditional CS cours- 
es. It is important to remember that 
ultimately, the goal is to facilitate stu- 
dents’ learning about the core CS con- 
cepts. Any dilution, even in favor of ac- 
quiescing to some students’ desire and 
motivation to become game developers, 
would do the students a disservice. 

» Textbook availability. As in all 
pioneering work, mature and well- 
organized materials are mostly under 
development. Although there are some 
textbooks available for specific ap- 
proaches (for example, CS1/2,'' com- 
puter graphics*’), mostly, one must be 
ready to develop custom reading mate- 
rial to guide students along. 
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THERE ARE MANY who believe we are 
on the verge of the biggest change in 
the way products are made since the 
Industrial Revolution kicked into high 
gear over 150 years ago. This is not a 
revolution where factories will make 
products faster or more efficiently, as 
with the earlier manufacturing revolu- 
tion; it is one where individuals will be 
in control of personal or customized 
production. This idea, often referred to 
as personal fabrication,’ is that people 
will have the means at home and in 
their offices to “print” new physical ob- 
jects to their exact specifications. This 
is no longer simply a far-fetched idea 
of science fiction. The ever-decreasing 
price of 3D printers as well as the dem- 
onstration of printable plastic elec- 
tronics and other such fabrication ma- 
chines leads one to see that personal 
fabrication is not far off. 

Some might question whether in- 
dividuals will want to design and pro- 
duce new physical objects or even cus- 
tomize existing objects. Although it is 
doubtful we will use these technolo- 
gies for all of the physical objects we 
own and use, the proliferation of cus- 
tomized t-shirts, shoes, posters, and 
other objects available via Web-based 
custom product shops gives credence 
to the idea that a large segment of the 
population will be interested in creat- 
ing objects customized to their specifi- 
cations. Whether personalized objects 
will come through these Web-based 
services or be created right at home 
on 3D fabrication devices is a question 
only time will answer. 

One major problem often ignored 


by this view of the future is how the in- | 


dividual will specify the design of the 
objects they wish to create with these 
technologies. It is one thing to stipu- 
late a phrase to go on a t-shirt in one of 
a large set of predefined layouts, colors, 
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and font families, and it is another to 
define the exact shape, color, patterns, 
and working mechanisms of a 3D part 
or a complete product. Training most 
of the public in the skills of industrial 
design is simply not viable. 

Design tools that can be used by 
everyday users to create customized 
products is the solution to the speci- 
fication problem in personalized 
fabrication. In fact, it is probably the 
most important technical problem 
that must be solved before the vision 
of personal fabrication can come into 
being. This is why the work of Igarashi 
and Igarashi described in the fol- 
lowing article is so important to the 
future—not only for computing, but 
for production. Many in the research 
community have worked to bring pow- 
erful, sketch-based design tools to 
non-computing experts, for example, 
architects,’ students,''° musicians,*° 
interface designers,”'' multimedia au- 
thors,’ and amateur animators.’ Some 
of the most innovative work in this 
area has been accomplished over the 
last 10 years by the University of Tokyo 
group led by Takeo Igarashi. The work 
has led the field in the design of 3D ob- 
jects by non-experts by creating design 
tools that allow a user to simply sketch 
a 2D representation and then have the 
system carry out the difficult underly- 
ing computation to build a reasonable 
3D model.® 


instructions to assemble the requisite 
pieces to create the finished product. 
The system described here is tech- 
nically sophisticated. By making the 
modeling and simulation phases con- 
current, the underlying model is always 
maintained as a realizable product. 


| This sophisticated computing makes 


At first read, this paper on designing | 


plush toys may seem to address a nar- 
row portion of the family of 3D objects 


that users will want to fabricate, but 


these plush toys share many of the im- 
portant characteristics of other com- 
plex design objects: different patterns/ 
colors for different parts, a non-rigid 
filling that can change the shape of 
the object depending on how the out- 
side fabric is sewn, the production of 
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it much easier for the designer, as they 
will never get into the trap of creating 
something that simply cannot be built. 
This paper is the first of what I hope 
will be a series of design tools that will 
one day allow the public to realize their 
own personally designed products. @ 
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Abstract 


Figure 1. Overview of Plushie system. 


We introduce Plushie, an interactive system that allows non- | 


professional users to design their own original plush toys. | 


To design a plush toy, one needs to construct an appropriate 
two-dimensional (2D) pattern. However, it is difficult for non- 
professional users to appropriately design a 2D pattern. Some 
recent systems automatically generate a 2D pattern for a given 
three-dimensional (3D) model, but constructing a 3D model 
is itself a challenge. Furthermore, an arbitrary 3D model can- 
not necessarily be realized as a real plush toy, and the final 
sewn result can be very different from the original 3D model. 
We avoid this mismatch by constructing appropriate 2D pat- 
terns and applying simple physical simulation to it on the fly 
during 3D modeling. In this way, the model on the screen is 
always a good approximation of the final sewn result, which 
makes the design process much more efficient. We use a 
sketching interface for 3D modeling and also provide various 
editing operations tailored for plush-toy design. Internally, 
the system constructs a 2D cloth pattern in such a way that 
the simulation result matches the user’s input stroke. We 
successfully demonstrated that nonprofessional users could 
design plush toys or balloon easily using Plushie. 


1. INTRODUCTION 
A computer can be a powerful tool for designing real- 
world objects. One can build a virtual three-dimensional 
(3D) model on a computer using computer-aided design 
(CAD) and use the model to run various simulations with 
computer-aided engineering (CAE) without the need 
to build or damage costly real objects. The benefits are 
evident in many areas from architecture to automobile 
design. However, these tools are mainly designed for pro- 
fessional users and are not particularly accessible to the 
ordinary person. The construction of a 3D model using a 
standard CAD system is tedious, and running a physical 
simulation using a standard CAE system requires a certain 
level of expertise. 

Our goal is to bring the benefits of CAD and CAE to the 
hands of nonprofessional users including children. This 
article introduces our plush-toy design system,'® Plushie, as 
an example of our efforts to achieve this goal. Plush toys are 
familiar objects in our daily lives, but their design is diffi- 
cult. One must design an appropriate two-dimensional (2D) 
pattern to obtain a particular 3D shape, but the relation- 
ship between the two is nontrivial, and intensive experience 
and knowledge are required to achieve satisfactory results. 
As a result, most people simply buy ready-made plush toys 
and do not enjoy the design and construction of their own. 


We have provided a way for people to design their own toys 
using a simple but powerful modeling tool that tightly inte- 
| grates a sketching interface with physical simulation in the 
| modeling process. 

Plushie allows the user to design a plush toy from scratch 
by simple sketching operations.'* The user first draws the 
desired silhouette, and the system automatically generates 
a 3D plush-toy model and corresponding 2D cloth pattern. 
The user can also edit the model, e.g., cut it or add a part, 
using a simple sketching interface, and the 3D model and 
2D cloth pattern are automatically updated. The 3D model 
is the result of a physical simulation that mimics the infla- 
tion of the sewn 2D cloth patch. Therefore, the model on 
the screen is always a good estimate of the final sewn result 
(Figure 1). When we ran workshops in a museum to have 
novice users try our system, we observed that even children 
could design their own plush toys. 

We first give an overview of sketching interfaces for 3D 
modeling and previous efforts to enable end users to design 
physical objects. We then describe the user interface and 
implementation of the Plushie system, followed by results 
and user experiences. Finally, we conclude the article with 
some discussion of future work. 


2. SKETCHING INTERFACES FOR 3D MODELING 

The sketching interface part of Plushie is an evolution of the 
Teddy system we presented in 1999.'' That system allowed the 
user to create an interesting 3D model simply by sketching a 
silhouette of the target model (Figure 2, left). It was designed 
for the modeling of free-form rotund models, a task that is 
particularly difficult using standard modeling interfaces. 
Figure 3 shows an example of a modeling sequence using 
Teddy. The user’s strokes are shown in red, and everything 
else is inferred and drawn by the system. The user first draws 


A previous version of this paper appeared in Proceedings 
of SIGGRAPH 2007/ACM Transactions on Graphics 26, 
3 (2007), 45. 
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Figure 2. Screenshot of Teddy and sample 3D models created using 
the system. 


IEE EE 2 ET SS EE SLY | IE RP TE TT 
Figure 3. Modeling session in Teddy. The user can create a 3D model 
using simple sketching operations. 


the silhouette of the base primitive, and the system gener- 
ates the corresponding 3D geometry. The user then draws 
a stroke across the model and the system cuts the model at 
the line. The user can also add parts to the base model by 
drawing two strokes. Figure 2 right shows several 3D models 
created using the system. 


Several sketching systems for free-form shapeswere devel- 


oped after Teddy. The original Teddy system used polygonal 
meshes, but some later systems experimented with other 
representations such as voxels*® and implicit surfaces.’ 
Some systems extended the interface to support subsequent 
editing by direct manipulation. ShapeShop*' represents 
a model as a collection of blob primitives and allows the 
user to move or scale each primitive. Fibermesh”’ keeps the 
original stroke as a control curve on the model surface and 
allows the user to adjust the shape by deforming the curves. 
However, all these systems are designed for purely virtual 3D 
models without consideration of the physical properties of 
materials. Plushie is innovative in that it shows the feasibil- 
ity of using a sketching interface for free-form shapes in the 
design of physical objects. 


3. DESIGNING PHYSICAL MODELS WITH 
A COMPUTER 


Another key aspect of our work is the tight integration of | 
physical simulation into the 3D modeling process. In tra- | 
ditional applications, modeling and simulation are com- | 
pletely separate. A virtual model is created in 3D modeling | 


software without considering any physical constraints, 
and it is then passed to a simulation environment. If the 
simulation result reveals a problem, the user returns to 
the model to fix the problem. We made this process more 
efficient by running the simulation concurrently with the 
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modeling to create only models that are physically realiz- 
able. In this way, the user can more efficiently explore the 
design dimensions within realistic constraints. From the 
user’s point of view, the model generated by the system 
may not correspond exactly to the shape that was input, 
but it will be a physically realizable shape reflecting the 
input shape. 

Some recent systems have tried to incorporate fast physi- 
cal simulation into an interactive design process. Igarashi 
and Hughes developed a mark-based interface for putting 
clothing on a virtual character,'? and Decaudin et al. pro- 
posed a system for designing an original garment via sketch- 
ing.° Both used simple geometric simulations to represent 
the physical properties of cloth material. Masry and Lipson 
described a system in which the user can quickly build a CAD 
model by sketching and immediately apply a finite element 
analysis to the model.'* However, the model construction is 
computed before the simulation in these systems, and no 
dynamic feedback loop exists between the simulation result 
and the original user input. 

Several efforts have also been made to support the design 
of physical objects by end users in the computer graphics 
research community. Mitani and Suzuki'® and Shatz et al.” 
presented automatic segmentation of a 3D model into sur- 
face patches that can be perfectly flattened onto a plane 
without distortion for constructing paper craft models. 
Similarly, Julius et al. proposed similar method for plush 
toys" allowing small distortion. Pillow system’ facilitates 
the manual segmentation of a model by providing automatic 
flattening and by showing the result of physical simulation. 
These systems make plush-toy design more accessible, but 
the fundamental challenge of creating an original plush toy 
is still unresolved. 


4. THE PLUSHIE SYSTEM 

The system consists of two windows: one shows the 3D 
plush-toy model being constructed and the other shows 
the corresponding 2D pattern (Figure 4). The user works on 
the 3D view, interactively building the 3D model by using a 
sketching interface. The 2D view is mainly for reference but 
the user can also edit the 2D pattern directly when desired. 
The 3D model is produced from a physical simulation of 
the assembled 2D pattern. After each input from the user, 


Figure 4. A screen snapshot of the Plushie system. 
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the system updates the 2D pattern so that the simulation 
result matches the user input. This guarantees that the 
model is always realizable as a real plush toy and that the 
2D pattern is readily usable as a template for cutting and 
sewing real fabric. 

The modeling operations are based on the Teddy sys- 
tem." The user interactively draws free-form strokes on the 
canvas as gestures and the system automatically generates | 
a 3D model and corresponding 2D cloth pattern. We also | 
provide some special editing operations tailored for plush- | 
toy design. 

Creating a New Model: Starting with a blank canvas, the user 
creates a new plush-toy model by drawing its silhouette as a 
closed free-form stroke. The system automatically generates 
two cloth patches corresponding to the stroke and visual- 
izes the shape of the resulting plush toy by applying a simple 
physical simulation (Figure 5). 

Cut: The user can cut the model by drawing a stroke that 
starts outside of the model, crosses it, and ends outside of 
the model (Figure 6). The model is cut at the intersection and 
flat patch is generated at the cross-section. This operation is 
useful for creating relatively flat surfaces, such as those ina 
foot or belly. 

Creation of a Part: The user can add protruding parts such — 
as the ears and arms to the base model by drawing a single 
stroke that defines the silhouette of the part. The stroke 
should start and end on the base model (Figure 7a). The sys- 
tem generates two candidate shapes and presents them to 
the user as suggestions?’ (Figure 7b). One is for fat, rounded 
parts like the body, arm, and leg (Figure 7c). Their base is 
connected to the base model with an open hole. The other 
candidate shape is for thin parts like ears and the tail 
whose base is closed (Figure 7d). The user clicks the desired 


Figure 5. Creating a new model. 


CS? 


Figure 6. Cut operation. 


Figure 7. User interface of part creation. (a) The user draws a stroke 
and (b) the system suggests two different possibilities. The user 
chooses one (c, d). 


thumbnail and the system updates the main model accord- 
ingly. We found that the ability to create thin parts with a 
single stroke is particularly useful. They are frequently seen 
in real toys and are difficult to design using standard model- 
ing software. Figure 20 shows a couple of example models 
with thin parts. 

Pull: The user can grab a seam line and pull it to modify 
the shape. For example, the user can pull an ear to make it 
larger when it is smaller than the other (Figure 8). The pull- 
ing operation begins when the user starts dragging on the 


| background region near a seam line. The system changes 


the mouse cursor when it approaches a seam line to indi- 
cate that the user can start pulling. We use the peeling inter- 
face introduced by Igarashi et al.’’ to adjust the size of the 
region to be deformed, that is, the larger area is deformed 
as the user pulls more. The system continuously updates the 
2D cloth pattern during pulling and shows the simulation 
result in the 3D view. 

Insertion and Deletion of Seam Lines: The modeling opera- 
tions performed thus far automatically generate 2D patches 


_ according to predefined algorithms and seam lines (patch 


boundaries) appear on the 3D model surface without the 
user’s explicit control. However, it is sometimes desir- 
able for knowledgeable users to design seam lines manu- 
ally, for more detailed control. This is especially important 
when using nonstretchy cloth as in balloon models because 
one needs to divide a rounded surface into many almost- 
developable small patches (Figure 19 bottom). 

The user can add a new seam in the seam line drawing 
mode by drawing a free-form stroke on the model surface 
(Figure 9). The corresponding cloth patch is then auto- 
matically cut along the new seam line. If the stroke crosses 
the entire patch, the patch is divided into two separate 
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Figure 9. Insertion of a seam line. (a) Before drawing a line, (b) after 
drawing a line, (c) the seam line’s two endpoints snap at other seam 
lines, and (d) After pulling the seam line. 


Figure 11. Patches connected to each other using connectors (b) and 
numbers (c). 


patches. If the stroke starts or ends in the middle of a 
patch, it becomes a dart. The 3D geometry does not change | 
immediately after the insertion of these seam lines, but the 
user can pull the seam line afterwards to modify the shape. 
This operation is very useful for creating a salient feature 
in the middle of a flat patch. Deletion is achieved by trac- 
ing the target seam line in the erasing mode. This merges 
the separated patches together and thus flattens the area 
(Figure 10). 

Operations on the 2D Pattern View: The 2D pattern view is 
mainly used to preview the pattern to be printed for sewing, 
but it also works as an interface for advanced users to edit the | 
pattern directly. The preview helps the user to understand 
the relationship between the 3D model and 2D patches and — 
gives a sense of the labor required for assembling the patches. | 
The system can display how patches are connected by show- 
ing connectors or paired numbers (Figure 11). Connectors 
are useful for understanding the relationship on the screen 
and numbers are useful as a printed reference on each 


84 COMMUNICATIONS OF THE ACM = DECEMBER 2009 VOL. 52 | NO. 12 


Figure 12. Pulling a 2D patch. 


patch. The system provides an automatic layout and manual 
arrangement interface for preparing the final pattern to be 
printed. 

The system also allows the user to edit the patches directly 
by using the pulling interface. The user can grab the bound- 
ary of a patch and pull it to deform the shape.'’ We again use 


| a peeling interface to adjust the size of area to be deformed. 


The effect of 2D deformation immediately appears in the 


| 3D view because of the physical simulation. The ability to 


deform an individual patch is useful for designing asym- 
metric shapes such as a penguin belly (Figure 12). The pull 


_ operation is also useful for opening a dart line to make a flat 


patch swell more (Figure 13). 


5. IMPLEMENTATION 

This section briefly describes the implementation of 
Plushie. A more detailed description is found in our original 
paper.'® We use a standard triangle mesh for the represen- 
tation of the 3D model and 2D patches. We use a relatively 
coarse mesh (1000-2000 vertices) to achieve interactive 
performance. Each vertex, edge, and face of the 3D mesh 
is associated with corresponding entities in the 2D mesh. 
A 3D mesh is always given as a result of applying a physical 
simulation to the assembled 2D pattern. To be more pre- 
cise, the physical simulation applied to the 3D mesh is gov- 
erned by the rest length of each edge and the rest length is 
defined in the 2D mesh. For each modeling operation, the 
system constructs the initial 2D patches and the 3D geom- 
etry corresponding to the input stroke, and then runs a 
physical simulation to update the 3D geometry. The system 
then adjusts the patch shape so that the simulation results 
match the input strokes. 


5.1. Physical simulation 

We use a simple static method for the physical simulation. 
We examined other, more elaborate methods, such as 
finite element methods,* dynamic simulation,°® and energy 
minimization,’ but we found that the simple approach is 


Figure 14. Our simple model to mimic stuffing effect. Internal 
pressure pushes vertices outwards (a) and edge springs pull 
them back (b). 


best suited for our purpose. It is easy to implement, fast 
enough for interactive modeling, and sufficiently robust 
for dealing with adverse user operations. More impor- 
tantly, it produces a reasonable estimation of the result- 
ing plush-toy shape. As shown in Figure 14, it successfully 
reproduces characteristic behaviors seen in the stuffed 


cloth. This algorithm is also used in a garment capture | 


system.’ 

In each simulation cycle, the system first moves each 
face slightly in its normal direction to mimic the effect of 
internal pressure (Figure 14a). The system then adjusts the 
length of each edge to preserve the integrity of the cloth 
material> (Figure 14b). We decided to prevent stretching 
only and tolerate compression because plush toys’ rotund 
shape is generated from compression (small wrinkles) 
along the seam lines. The second part (adjustment of edge 
length) runs 10 times in each cycle to prevent excessive 
stretch. It takes ~30 simulation cycles (2s) to converge in 
our typical examples. Although it is possible to show the 
result only after the convergence, we decided to show the 
intermediate shape because test users preferred to see the 
inflation process. 


5.2. 3D modeling operations 

Creating a New Model: The input stroke is projected onto an 
invisible plane at the center of the world facing the screen, 
and the system generates an initial two-sided mesh inside of 
the closed region. Each side of the mesh is used directly as a 
2D patch for the model. The system then applies the physical 
simulation to the mesh. It inflates the mesh to the direction 
perpendicular to the viewing direction, but its silhouette 
actually becomes smaller as it inflates (Figure 15). The sys- 
tem waits until the simulation converges and then starts to 
adjust the 2D pattern so that the simulation result matches 
the input stroke. Specifically, the system calculates the dis- 
tance d, from a vertex v, of the 3D mesh along the seam line 
to the corresponding point p, in the projected input stroke 


along its normal direction, and moves the corresponding — 
2D vertex uw, on the patch boundary to its normal direction | 


by that amount d,. After modifying the patch boundary, the 
system updates the 2D mesh so that vertices are uniformly 
distributed inside of the patch. The length of the edges in 
the updated 2D mesh is then used as the new rest length 
in the simulation. The system repeats this adjustment pro- 
cess and the physical simulation until convergence. It takes 
approximately 20 iterations (2 seconds) to converge in our 
typical examples. 


Figure 15. Adjustment process after creation. The system enlarges 
the 2D pattern so that the simulation result matches the input 
stroke. The 2D boundary vertex (v) moves in its normal direction by 
the amount proportional to the distance between the corresponding 
3D vertex and the input stroke. 


This simple algorithm works well in practice for our 

application. Figure 16 shows some examples in which our 
algorithm successfully found appropriate 2D patches that 
yielded the desired 3D shapes. In some situations, the 
input shape is not realizable as a plush-toy model con- 
sisting of two patches. For example, a sharp concavity is 
not realizable without causing self-intersection in the 2D 
patch. In these cases, the system terminates the optimiza- 
tion process, leaving a gap between the input stroke and 
_ the 3D model. This indicates that the desired shape is not 
possible with two patches. The user must add additional 
seam lines to obtain more control. 
Cut: The system constructs a curved surface by sweep- 
ing the cutting stroke on the screen to the viewing 
direction and dividing the mesh along the surface. The 
right-hand side of the surface is removed and a new mesh 
is created on the cross-section. The cross-section is always 
developable, so the system simply flattens it and uses it as 
a 2D patch. The system then applies the inflation process 
to the model. Note that the silhouette of the inflated 3D 
model does not exactly matches with the input cut stroke 
because we do not apply any adjustment as in the initial 
creation. 


TS ES SES TE PE LEE II EO TT IE LETTE 
Figure 16. Physical simulation and shape adjustment. The red 


_ lines indicate the input strokes. The top row shows the result 

of converting the input into patterns directly, and the bottom 

row shows the outcome when the adjustment process is applied 

to the patterns. The green shapes in the middle show the simulation 
results and the brown ones on the right show the real fabric 
models, both resulting from the 2D pattern on the left. 
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Figure 17. (a) Creation of a part. The system projects the input stroke 


to a working plane and cuts the base mesh with either an elliptic 
curve or a line (b). The 3D geometry is constructed by creating a 
mesh between the projected stroke and the base curves (c). 


(a) 


Creation of a Part: The system first projects the two end- 
points of the input stroke onto the base model surface. 
A plane that passes through these 3D points and is facing 
toward the screen is constructed and the input stroke is 
projected onto it. The system then draws an ellipse on the 
model surface for constructing a fat part and draws a line for 
creating a thin part (Figure 17). The ellipse or the line (what 


we call base curves) is also projected to the plane. The sys- | 


tem generates a 2D mesh on the projection plane in the area | 


enclosed by the projected input stroke and the projected | 


base curve. The 2D mesh is duplicated and serves as 2D pat- 
tern and as the initial 3D geometry for the added part. As in 
the initial model creation case, the flat two-sided 3D mesh is 
inflated by physical simulation. The silhouette of the added 
part gradually shrinks and the system enlarges the 2D pat- 
tern so that the silhouette matches the input stroke as in 
initial creation. 

In case of a part with an elliptic base curve, the system 
cuts open the base surface and stitches it with the newly cre- 
ated mesh. The result is a single connected mesh, and phys- 
ical simulation is applied uniformly to the entire mesh. On 
the other hand, the system does not open the base mesh 
in case of the linear base curve. The new part is created as 


an independent closed mesh and the simulation is applied | 


separately to the base mesh and the new part. The base 


mesh inflates independently of the part mesh, and the base | 


curve is treated as a positional constraint in the simulation 
of the part mesh (we simply do not move these vertices in 
the simulation cycle). 

Pull: The pull operation is a bit involved because the sys- 
tem cannot directly modify the 3D mesh and must do so 
indirectly by deforming the corresponding 2D pattern. As 
the user starts pulling a vertex on a seam line, the system 
first constructs a projection plane that passes through 
the seam line (Figure 18). The mouse cursor position on 
the screen is projected onto the projection plane, and it is 
used as a target position for the pulled vertex during subse- 
quent dragging. The system computes the displacement 6, 
in the local coordinate frame on the projected plane from 
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Figure 18. Pulling a vertex on a seam line. 


the original position v, to the target position h,, and moves 
the corresponding vertices u° and u,' in the 2D mesh in 
their local coordinate frames by that amount 6. These 3D 
and 2D coordinate frames are defined by the pulled ver- 
tex’s normal vector and the direction of the seam line. The 
system iterates this displacement process with physical 
simulation until it converges. To achieve a smooth defor- 
mation, the system also moves the surrounding vertices 
in the 2D mesh using the curve manipulation method 
introduced in Igarashi et al.’ It enlarges the region to be 
deformed proportional to the displacement of the pulled 
vertex. 

Insertion and Deletion of Seam Lines: Insertion of a new 
seam line is straightforward. The system simply cuts the 
patch along the added seam line, and basically does not 
change the result of simulation. Deletion is more compli- 
cated because the merged patch is not necessarily devel- 
opable. The system applies an approximate flattening 
operation” to the merged 3D surface to obtain the geometry 
of the new 2D patch. 


6. RESULTS 

Plushie is implemented as a Java™ program. Construction 
of 2D patterns and a physical simulation run in real-time 
on a 1.1GHz Pentium M PC. We designed a couple of 
plush toys using our system and created a real toy based 


| on the printed pattern. A modeling session typically takes 


10-20 min and sewing takes 3-5 h. Figure 19 shows a plush 
toy and balloon model designed in our system. It shows 
that the physical simulation successfully captures the over- 
all shape of the real objects. We interviewed with profes- 
sional balloon designers and they supported our system, 
saying that it can significantly reduce the time necessary 
for designing original balloon. 

The user can assign different textures to individual 
patches (Figure 20). Therefore the user can explore various 
design possibilities before actuary sewing the real fabric 
(such as Figure 20 right). These models also demonstrate 
the effectiveness of thin parts. 

We ran four small workshops to test the usability of the 
system and found that novice users, mainly children, can 
successfully create original plush toys using our system. 
Here is an observation from one of these workshops. Nine 


Figure 19. A plush toy and a balloon designed in our system. 


Figure 20. Example of texture changed. These models have many 
thin parts. 


ccd 


Figure 21. Example of original plush toys designed and created by 
children in the workshop. 


children, approximately 10-14 years old, joined the work- 
shop accompanied by their parents. We gave a brief tuto- 
rial at the beginning and had them design their own plush 
toys using the system. It took about an hour for the design. 
They then printed the designed pattern and sewed a real 
toy in ~3h. Figure 21 shows a couple of plush toys created 
in the workshop. Participants quickly learned how to use 
the system and successfully designed the 3D models they 
wanted, with some help from volunteers. Furthermore, 


Figure 22. Knitty system allows the user to design an original knitted 
toy by simply drawing the desired silhouette. 


they enjoyed the process. These toys were their own cre- 
ations and one-of-a-kind designs. Participants also gave us 


_ valuable feedback for future improvement. They wanted to 
_ have some auxiliary functions such as the ability to design 
| symmetric parts and remove existing parts, but no one 


complained about the quality of the visual simulation. A 
perfectly accurate simulation is not necessary because 
many small variations inevitably occur during the real sew- 
ing and stuffing process. 


7. CONCLUSION 

We introduced a plush-toy design system as an example of 
our efforts to make CAD and CAE accessible to end users. 
The system allows the user to design a plush toy quickly and 
_ simply by combining simple sketching operations. The user 
draws the desired silhouette on the canvas, and the system 


_ automatically generates a 3D plush-toy model anda 2D cloth 


pattern. The system runs a simple physical simulation in the 
background so that the resulting 3D model is always a good 
estimate of the final sewn result. The user can construct a 
real plush toy by printing the pattern and sewing the result- 
ing pieces together. 

To demonstrate the effectiveness of the approach 
even further, we also developed a system for the design of 
knitted toys.'’ A knitted toy is a toy made of knitted yarn 
instead of cloth patches. One can construct a knitted 
toy by knitting according to a specific knitting pattern, 
but it is difficult to produce an appropriate knitting pat- 


| tern for a desired 3D shape. The Knitty system allows the 


user to design an original knitted toy by simply drawing the 


_ desired silhouette (Figure 22). The system then generates a 


3D knitted animal model and corresponding knitting dia- 
gram. We ran a workshop using this system and observed 
that children could use it to design their own knitted 
animals. 

Interactive 3D modeling assisted by concurrent physi- 
cal simulation can be a powerful tool in many other 
application domains. For example, if one could run an 
aerodynamic simulation during the interactive design 
of a paper airplane model, the entire geometry could be 
adjusted in an intelligent manner in response to the user’s 
simple deformation operations to produce a model that 
could actually fly. This kind of interaction would make it 
easier for designers to pursue aesthetic goals while satis- 
fying engineering constraints. Real-time simulation does 


_ require high-performance computing resources, but some 


meaningful support should be possible by carefully limit- 
ing the target task and designing appropriate interfaces as 
shown in this paper. We hope that our work inspires more 
work in this direction. a 
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A Graphical Sense of Touch 


By Pat Hanrahan 


ONE OF THE Major innovations in com- 
puting was the invention of the graph- 
ical user interface at MIT, SRI, and 
Xerox PARC. The combination of com- 
puter graphics hardware with a mouse 
and keyboard enabled a new class of 
highly interactive applications based 
on direct manipulation of on-screen 
displays. 

It is interesting to reflect on the rela- 
tive rate of advance of input and output 
technology since the very first systems. 
At the time, graphics hardware consist- 
ed of single-bit framebuffers output- 
ting to black-and-white displays. 

Moving forward, we now have | 
flat-panel, high-definition, full-color | 
displays driven by inexpensive, high- 
performance graphics chips capable 
of drawing three-dimensional virtual 
worlds in real time. 

Graphics hardware draws tens of mil- 
lions of polygons and tens of billions of 
pixels per second. In comparison, most 
personal computers ship with a mouse 
and keyboard similar to that used at Xe- 
rox PARC in the 1970s. 

The lack of progress in input tech- 
nology has caused computers to be- 
come sensory deprived. They can 
output incredible displays of infor- 
mation, but they receive almost no 
information from their surrounding 
environment. 

Contrast this situation to a living or- 
ganism. Most organisms have extraor- 
dinary abilities to sense their environ- 
ment, but limited ability to display 
information (except by movement; a 
few animals like the chameleon and 
the cuttlefish can change their skin 
color). Perhaps this explains why we 
enjoy interacting with our pets more 
than with our computers. 

Stuart Card, a Senior Research Fel- 
low at Xerox PARC, has observed that 
one of the breakthrough ideas in the 
graphical user interface was to am- 
plify input relative to output. One | 
mechanism is to enable input to beon | 
output. Examples include on-screen | 
buttons and menus. By leveraging | 


output technology, we augment lim- | 


ited input by providing context. An- 
other strategy for enhancing input is 
to use pattern recognition to extract 
as much information as possible from 
the stream of sensed data. 

Fortunately, this state of sensory de- 
privation is beginning to change. 

The biggest recent development is 
the commercial emergence of multi- 


_ touch displays. Traditional display in- 


put technology only returns a single X, 


Y position ata time. Asa result, the user _ 


can only point to a single location at a 
time and, consequently, use only one 


_ finger or one hand ata time. 


In a multitouch display, multiple 
points are sensed simultaneously. 
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sory modalities, including a micro- 
phone, camera, accelerometer, and a 
GPS receiver and compass. Relative to 
a modern desktop computer, it is sen- 
sory rich and output poor. 

The following paper by a team from 
Microsoft Research introduces a very 
novel way to build a multitouch in- 
terface—the ThinSight system. They 
modify a flat-panel display to sense 


| touch directly. Previously, touch was 


sensed indirectly; for example, by 
mounting a camera to look at the sur- 


_ face of the display. The camera-based 
| approaches require a fairly large space, 


This allows the application to sense | 


multiple fingers from both hands. 
This in turn makes it possible to rec- 
ognize finger gestures or coordinated 
two-handed motion. 

Successful commercial examples 
of multitouch displays include the Ap- 
ple iPhone and the Microsoft Surface. 
The iPhone has a unique user inter- 
face that is enabled by an embedded 
multitouch display. To zoom into a 
map, the user simply moves their fin- 
gers apart. Beyond touch, the iPhone 
has several additional built-in sen- 
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have problems with occlusion, and are 
difficult to calibrate. In the ThinSight 
system, the LED backlight that drives 
the display is modified to include in- 
frared LEDs and sensors interspersed 
amid the visible light emitters. 

The display surface can both emit 
light and sense position. Distribut- 
ing sensors throughout the display 
substrate yields a compact, efficient 
design. 

This paper is important because 
it proposes an innovative design that 
addresses a long-standing problem. 
However, there is much more work to 
do in this area. The authors have add- 
ed the sense of touch to the display, 
but like real touch sensors it has lim- 
ited resolution and the sensed object 
must be in contact with the display. In 
Pierre Wellner’s pioneering work on 
the digital desk, the computer sensed 
remote hand positions as well as the 
position, type, and content of objects 
on the desk. Unfortunately, Wellner’s 
system involved bulky cameras and 
projectors. 

Hopefully HCI researchers will ex- 
pand on the innovative sensing strategy 
proposed in this paper. We want com- 
pact interactive devices that have rich 
sensory capabilities including touch, 
sight, and hearing. 


Pat Hanrahan is the CANON Professor of Computer 
Science and Electrical Engineering in the Computer 
Graphics Laboratory at Stanford University, Stanford, CA. 
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ThinSight: A Thin Form-Factor 
Interactive Surface Technology 


By Shahram Izadi, Steve Hodges, Alex Butler, Darren West, Alban Rrustemi, Mike Molloy and William Buxton 


ABSTRACT 

ThinSight is a thin form-factor interactive surface technol- 
ogy based on optical sensors embedded inside a regular 
liquid crystal display (LCD). These augment the display 
with the ability to sense a variety of objects near the sur- 
face, including fingertips and hands, to enable multitouch 
interaction. Optical sensing also allows other physical items 
to be detected, allowing interactions using various tan- 
gible objects. A major advantage of ThinSight over existing 
camera and projector-based systems is its compact form- 
factor, making it easier to deploy in a variety of settings. We 
describe how the ThinSight hardware is embedded behind 
a regular LCD, allowing sensing without degradation of dis- 
play capability, and illustrate the capabilities of our system 
through a number of proof-of-concept hardware prototypes 
and applications. 


1. INTRODUCTION 
Touch input using a single point of contact with a display 
is a natural and established technique for human computer 
interaction. Research over the past decades,’ and more 
recently products such as the iPhone and Microsoft Surface, 
have shown the novel and exciting interaction techniques 
and applications possible if multiple simultaneous touch 
points can be detected. 

Various technologies have been proposed for multitouch 
sensing in this way, some of which extend to detection of 
physical objects in addition to fingertips. Systems based 
on optical sensing have proven to be particularly powerful 
in the richness of data captured and the flexibility they can 
provide. As yet, however, such optical systems have predom- 
inately been based on cameras and projectors and require 
a large optical path in front of or behind the display. This 
typically results in relatively bulky systems—something that 
can impact adoption in many real-world scenarios. While 
capacitive overlay technologies, such as those in the iPhone 
and the Dell XT Tablet PC, can support thin form-factor mul- 
titouch, they are limited to sensing only fingertips. 

ThinSight is a novel interactive surface technology which 
is based on optical sensors integrated into a thin form-factor 
LCD. It is capable of imaging multiple fingertips, whole 
hands, and other objects near the display surface as shown 
in Figure 1. The system is based upon custom hardware 
embedded behind an LCD, and uses infrared (IR) light for 
sensing without degradation of display capability. 

In this article we describe the ThinSight electronics and the 
modified LCD construction which results. We present two pro- 
totype systems we have developed: a multitouch laptop and a 

touch-and-tangible tabletop (both shown in Figure 1). These 
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systems generate rich sensor data which can be processed 
using established computer vision techniques to prototype a 
wide range of interactive surface applications. 

As shown in Figure 1, the shapes of many physical objects, 
including fingers, brushes, dials, and so forth, can be “seen” 
when they are near the display, allowing them to enhance 
multitouch interactions. Furthermore, ThinSight allows 
interactions close-up or at a distance using active IR point- 
ing devices, such as styluses, and enables IR-based commu- 
nication through the display with other electronic devices. 

We believe that ThinSight provides a glimpse of a future 
where display technologies such as LCDs and organic light 
emitting diodes (OLEDs) will cheaply incorporate optical 
sensing pixels alongside red, green and blue (RGB) pixels in 


(Se 9 eas ee ee RE SS cee ee LE EO SE a 
Figure 1. ThinSight brings the novel capabilities of surface computing 


to thin displays. Top left: photo manipulation using multiple fingers on 
a laptop prototype (note the screen has been reversed in the style of 

a Tablet PC). Top right: a hand, mobile phone, remote control and reel 
of tape placed on a tabletop ThinSight prototype, with corresponding 
sensor data far right. Note how all the objects are imaged through the 
display, potentially allowing not only multitouch but tangible input. 
Bottom left and right: an example of how such sensing can be used 

to support digital painting using multiple fingertips, a real brush and 
a tangible palette to change paint colors. 


Original versions of this paper appeared in Proceedings 
of the 2007 ACM Symposium on User Interface Software 
and Technology as “ThinSight: Versatile Multi-touch 
Sensing for Thin Form-factor Displays” and in 
Proceedings of the 2008 IEEE Workshop on Horizontal 
Interactive Human Computer Systems as “Experiences 
with Building a Thin Form-Factor Touch and Tangible 
Tabletop.” 


a similar manner, resulting in the widespread adoption of | 
such surface technologies. 


2.2. Further features of ThinSight 


| ThinSight is not limited to detecting fingertips in con- 


| tact with the display; any suitably reflective object will 


2. OVERVIEW OF OPERATION 


2.1. Imaging through an LCD using IR light 
A key element in the construction of ThinSight is a device 
known as a retro-reflective optosensor. This is a sensing ele- 
ment which contains two components: a light emitter and | 
an optically isolated light detector. It is therefore capable 
of both emitting light and, at the same time, detecting the | 
intensity of incident light. If a reflective object is placed in 
front of the optosensor, some of the emitted light will be 
reflected back and will therefore be detected. 
ThinSight is based around a 2D grid of retro-reflective 
optosensors which are placed behind an LCD panel. Each 
optosensor emits light that passes right through the entire 
panel. Any reflective object in front of the display (such as a | 
fingertip) will reflect a fraction of the light back, and this can 
be detected. Figure 2 depicts this arrangement. By using a suit- 
ably spaced grid of retro-reflective optosensors distributed uni- 
formly behind the display it is therefore possible to detect any 
number of fingertips on the display surface. The raw data gen- 
erated is essentially a low resolution grayscale “image” of what 
can be seen through the display, which can be processed using 
computer vision techniques to support touch and other input. 
A critical aspect of ThinSight is the use of retro-reflective 
sensors that operate in the infrared part of the spectrum, for 
three main reasons: 
~ Although IR light is attenuated by the layers in the LCD 
panel, some still passes through the display.° This is 
largely unaffected by the displayed image. 

~ A human fingertip typically reflects around 20% of inci- 
dent IR light and is therefore a quite passable “reflective 
object.” 

— IR light is not visible to the user, and therefore does not 
detract from the image being displayed on the panel. 


EST ES AI NEE ISS ELE ES SATE AEE, CIE DETTE GAS TELL, 
Figure 2. The basic principle of ThinSight. An array of retro-reflective 
optosensors is placed behind an LCD. Each of these contains two 
elements: an emitter which shines IR light through the panel; and 

a detector which picks up any light reflected by objects such as 
fingertips in front of the screen. 


LCD panel 


Emitter 


Detector 


Optosensor array 


cause IR light to reflect back and will therefore generate a 
“silhouette.” Not only can this be used to determine the loca- 
tion of the object on the display, but also its orientation and 
shape, within the limits of sensing resolution. Furthermore, 
the underside of an object may be augmented with a visual 
mark—a barcode of sorts—to aid identification. 

In addition to the detection of passive objects via their 
shape or some kind of barcode, it is also possible to embed 
a very small infrared transmitter into an object. In this way, 
the object can transmit a code representing its identity, its 
state, or some other information, and this data transmission 
can be picked up by the IR detectors built into ThinSight. 
Indeed, ThinSight naturally supports bidirectional IR-based 
data transfer with nearby electronic devices such as smart- 
phones and PDAs. Data can be transmitted from the dis- 
play to a device by modulating the IR light emitted. With a 
large display, it is possible to support several simultaneous 
bidirectional communication channels in a spatially multi- 
plexed fashion. 

Finally, a device which emits a collimated beam of IR light 
may be used as a pointing device, either close to the display 
surface like a stylus, or from some distance. Such a pointing 
device could be used to support gestures for new forms of 
interaction with a single display or with multiple displays. 
Multiple pointing devices could be differentiated by modu- 


| lating the light generated by each. 


3. THE THINSIGHT HARDWARE 


3.1. The sensing electronics 
The prototype ThinSight circuit board depicted in Figure 


| 3 uses Avago HSDL-9100 retro-reflective infrared sensors. 


These devices are especially designed for proximity sensing 
—an IR LED emits infrared light and an IR photodiode gener- 
ates a photocurrent which varies with the amount of incident 
light. Both emitter and detector have a center wavelength of 


| 940nm. 


A 7 x 5 grid of these HSDL-9100 devices on a regular 
10mm pitch is mounted on custom-made 70 x 50mm 
4-layer printed circuit board (PCB). Multiple PCBs can be 
tiled together to support larger sensing areas. The IR detec- 
tors are interfaced directly with digital input/output lines on 
a PIC18LF4520 microcontroller. 

The PIC firmware collects data from one row of detec- 
tors at a time to construct a “frame” of data which is then 
transmitted to the PC over USB via a virtual COM port. To 


| connect multiple PCBs to the same PC, they must be syn- 


chronized to ensure that IR emitted by a row of devices on 
one PCB does not adversely affect scanning on a neighbor- 
ing PCB. In our prototype we achieve this using frame and 
row synchronization signals which are generated by one 
of the PCBs (the designated “master”) and detected by the 
others (“slaves”). 
Note that more information on the hardware can be 
found in the full research publications.” ° 
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Figure 3. Top: the front side of the sensor PCB showing the 7 x5 array 
of IR optosensors. The transistors that enable each detector are 
visible to the right of each optosensor. Bottom: the back of the 
sensor PCB has little more than a PIC microcontroller, a USB interface 
and FETs to drive the rows and columns of IR emitting LEDs. Three 
such PCBs are used in our ThinSight laptop while there are thirty in 
the tabletop prototype. 


USB interface 


IRLED 
column drivers 


PIC micro 


3.2. LCD technology overview 
To understand how the ThinSight hardware is integrated | 
into a display panel, it is useful to understand the construc- 
tion and operation of a typical LCD. An LCD panel is made 
up of a stack of optical components as shown in Figure 4. At 
the front of the panel is a thin layer of liquid crystal material 
which is sandwiched between two polarizers. The polarizers 
are orthogonal to each other, which means that any light | 
which passes through the first will naturally be blocked by 
the second, resulting in dark pixels. However, if a voltage is 
applied across the liquid crystal material at a certain pixel 
location, the polarization of light incident on that pixel is 
twisted through 90° as it passes through the crystal struc- 
ture. As a result it emerges from the crystal with the correct 
polarization to pass through the second polarizer. Typically, 
white light is shone through the panel from behind by a 
backlight and red, green, and blue filters are used to create 
a color display. In order to achieve a low profile construction 
while maintaining uniform lighting across the entire display | 
and keeping cost down, the backlight is often a large “light | 
guide” in the form of a clear acrylic sheet which sits behind 

the entire LCD and which is edge-lit from one or more sides. 

The light source is often a cold cathode fluorescent tube 

or an array of white LEDs. To maximize the efficiency and 

uniformity of the lighting, additional layers of material may 
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| Diffuser and brightness 


Figure 4. Typical LCD edge-lit architecture shown left. The LCD 
comprises a stack of optical elements. A white light source is 
typically located along one or two edges at the back of the panel. 

A white reflector and transparent light guide direct the light toward 
the front of the panel. The films help scatter this light uniformly and 
enhance brightness. However, they also cause excessive attenuation 
of IR light. In ThinSight, shown right, the films are substituted and 
placed behind the light guide to minimize attenuation and also 


| reduce noise caused by LCD flexing upon touch. The sensors and 


emitters are placed at the bottom of the resulting stack, aligned with 
holes cut in the reflector. 


Standard edge-lit LCD Standard edge-lit LCD 


with ThinSight 


LCD ana 


polarizers 


Enhancing Film 
Light guide 


Neutral density filter 
and Radiant Light Film 
Reflector 


Sensor Emitter 


be placed between the light guide and the LCD. Brightness 
enhancing film (BEF) “recycles” visible light at suboptimal 
angles and polarizations and a diffuser smoothes out any 
local nonuniformities in light intensity. 


3.3. Integration with an LCD panel 

We constructed our ThinSight prototypes using a variety of 
desktop and laptop LCD panels, ranging from 17” to 21”. 
Two of these are shown in Figures 5 and 6. Up to 30 PCBs 
were tiled to support sensing across the entire surface. In 
instances where large numbers of PCBs were tiled, a cus- 
tom hub circuit based on an FPGA was designed to collect 
and aggregate the raw data captured from a number of tiled 
sensors and transfer this to the PC using a single USB chan- 
nel. These tiled PCBs are mounted directly behind the light 
guide. To ensure that the cold cathode does not cause any 
stray IR light to emanate from the acrylic light guide, we 
placed a narrow piece of IR-blocking film between it and 
the backlight. We cut small holes in the white reflector 
behind the light guide to coincide with the location of every 
IR emitting and detecting element. 

During our experiments we found that the combination 
of the diffuser and BEF in an LCD panel typically caused 
excessive attenuation of the IR signal. However, removing 
these materials degrades the displayed image significantly: 
without BEF the brightness and contrast of the displayed 
image is reduced unacceptably; without a diffuser the image 
appears to “float” in front of the backlight and at the same 
time the position of the IR emitters and detectors can be 
seen in the form of an array of faint dots across the entire 
display. 

To completely hide the IR emitters and detectors we 
required a material that lets IR pass through it but not vis- 
ible light, so that the optosensors could not be seen but 
would operate normally. The traditional solution would be 


| SA BR PA I, SR I ELF TS TTA 
Figure 5. Our laptop prototype. Top: Three PCBs are tiled together 


and mounted on an acrylic plate, to give a total of 105 sensing 
pixels. Holes are also cut in the white reflector shown on the 
far left. Bottom left: an aperture is cut in the laptop lid to allow 
the PCBs to be mounted behind the LCD. This provides sensing 
across the center of the laptop screen. Bottom right: side views 
of the prototype—note the display has been reversed on its 
hinges in the style of a Tablet PC. 


Figure 6. The ThinSight tabletop hardware as viewed from 

the side and behind. Thirty PCBs (in a 5x6 grid) are tiled with 
columns interconnected with ribbon cable and attached to 

a hub board for aggregating data and inter-tile communication. 
This provides a total of 1050 discrete sensing pixels across 
the entire surface. 


to use what is referred to as a “cold mirror.” Unfortunately | 


these are made using a glass substrate which means they are 
expensive, rigid and fragile and we were unable to source a 
cold mirror large enough to cover the entire tabletop dis- 
play. We experimented with many alternative materials 
including tracing paper, acetate sheets coated in emulsion 
paint, spray-on frosting, thin sheets of white polythene 
and mylar. Most of these are unsuitable either because of 


| a lack of IR transparency or because the optosensors can 
_ be seen through them to some extent. The solution we set- 
| tled on was the use of Radiant Light Film by 3M (part num- 
_ ber CM500), which largely lets IR light pass through while 


reflecting visible light without the disadvantages of a true 
cold mirror. This was combined with the use of a grade “0” 
neutral density filter, a visually opaque but IR transparent 
diffuser, to even out the distribution rear illumination and 
at the same time prevent the “floating” effect. Applying the 
Radiant Light Film carefully is critical since minor imper- 
fections (e.g. wrinkles or bubbles) are highly visible to the 
user—thus we laminated it onto a thin PET carrier. One 
final modification to the LCD construction was to deploy 
these films behind the light guide to further improve the 
optical properties. The resulting LCD layer stack-up is 
depicted in Figure 4 right. 

Most LCD panels are not constructed to resist physical 
pressure, and any distortion which results from touch inter- 
actions typically causes internal IR reflection resulting in 
“flare.” Placing the Radiant Light Film and neutral density 
filter behind the light guide improves this situation, and 
we also reinforced the ThinSight unit using several lengths 
of extruded aluminum section running directly behind 


the LCD. 


4. THINSIGHT IN OPERATION 


4.1. Processing the raw sensor data 


| Each value read from an individual IR detector is defined 


as an integer representing the intensity of incident light. 
These sensor values are streamed to the PC via USB where 
the raw data undergoes several simple processing and fil- 
tering steps in order to generate an IR image that can be 


| used to detect objects near the surface. Once this image is 


generated, established image processing techniques can be 
applied in order to determine coordinates of fingers, recog- 
nize hand gestures, and identify object shapes. 

Variations between optosensors due to manufacturing 
and assembly tolerances result in a range of different values 
across the display even without the presence of objects on 
the display surface. To make the sensor image uniform and 
the presence of additional incident light (reflected from 


_ nearby objects) more apparent, we subtract a “background” 


frame captured when no objects are present, and normalize 
relative to the image generated when the display is covered 
with a sheet of white reflective paper. 

We use standard bicubic interpolation to scale up the 
sensor image bya predefined factor (10 in ourcurrent imple- 
mentation). For the larger tabletop implementation this 
results in a 350 x 300 pixel image. Optionally, a Gaussian 
filter can be applied for further smoothing, resulting in a 
grayscale “depth” image as shown in Figure 7. 


4.2. Seeing through the ThinSight display 
The images we obtain from the prototype are quite rich, par- 
ticularly given the density of the sensor array. Fingers and 
hands within proximity of the screen are clearly identifiable. 
Examples of images captured through the display are shown 
in Figures 1, 7 and 8. 
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Figure 7. The raw ThinSight sensor data shown left and after 
interpolation and smoothing right. Note that the raw image is a very 
low resolution, but contains enough data to generate the relatively 
rich image at right. 


Figure 8. Fingertips can be sensed easily with ThinSight. Left: the 
user places five fingers on the display to manipulate a photo. Right: 

a close-up of the sensor data when fingers are positioned as shown 
at left. The raw sensor data is: (1) scaled-up with interpolation, 

(2) normalized, (3) thresholded to produce a binary image, and finally 
(4) processed using connected components analysis to reveal the 
fingertip locations. 


Fingertips appear as small blobs in the image as they 
approach the surface, increasing in intensity as they get 
closer. This gives rise to the possibility of sensing both touch 
and hover. To date we have only implemented touch/no- 
touch differentiation, using thresholding. However, we can 
reliably and consistently detect touch to within a few milli- 
meters for a variety of skin tones, so we believe that disam- 
biguating hover from touch would be possible. 

In addition to fingers and hands, optical sensing allows 
us to observe other IR reflective objects through the display. 
Figure 1 illustrates how the display can distinguish the shape 
of many reflective objects in front of the surface, including 
an entire hand, mobile phone, remote control, and a reel 
of white tape. We have found in practice that many objects 
reflect IR. 

A logical next step is to attempt to uniquely identify 
objects by placement of visual codes underneath them. Such 
codes have been used effectively in tabletop systems such as 
the Microsoft Surface and various research prototypes'”** to 
support tangible interaction. We have also started prelimi- 
nary experiments with the use of such codes on ThinSight, 
see Figure 9. 

Active electronic identification schemes are also feasible. 
For example, cheap and small dedicated electronic units 
containing an IR emitter can be stuck onto or embedded 


4.3. Communicating through the ThinSight display 

Beyond simple identification, an embedded IR transmit- 
ter also provides a basis for supporting richer bidirectional 
communication with the display. In theory any IR modula- 
tion scheme, such as the widely adopted IrDA standard, 
could be supported by ThinSight. We have implemented 
a DC-balanced modulation scheme which allows retro- 
reflective object sensing to occur at the same time as data 
transmission. This required no additions oralterations to the 
sensor PCB, only changes to the microcontroller firmware. 
To demonstrate our prototype implementation of this, we 
built a small embedded IR transceiver based on a low power 
MSP430 microcontroller, see Figure 10. We encode 3 bits of 
data in the IR transmitted from the ThinSight pixels to con- 
trol an RGB LED fitted to the embedded receiver. When the 


| user touches various soft buttons on the ThinSight display, 


inside objects that need to be identified. These emitters will | 


produce a signal directed to a small subset of the display 
sensors. By emitting modulated IR it is possible to transmit 
a unique identifier to the display. 
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this in turn transmits different 3 bit codes from ThinSight 
pixels to cause different colors on the embedded device to 
be activated. 

It is theoretically possible to transmit and receive dif- 
ferent data simultaneously using different columns on the 


| RAR SRG ERR LEAT SE OPTI, PRS SY ES ES I TG PES IS SEE ORE OT! 
Figure 9. An example 2" diameter visual marker and the resulting 


ThinSight image after processing. 


EAS LT 7 SSIES WEN ST RTS RSE AE RS Oe SS EN ME 
Figure 10. Using ThinSight to communicate with devices using IR. 
Top left: an embedded microcontroller/IR transceiver/RGB LED 
device. Bottom left: touching a soft button on the ThinSight display 
signals the RGB LED on the embedded device to turn red (bottom 
right). Top right: A remote control is used to signal from a distance 


| the display whichin turn sends an IR command to the RGB device to 


turn the LED blue. 


display surface, thereby supporting spatially multiplexed 
bidirectional communications with multiple local devices 


and reception of data from remote gesturing devices. Of | 


course, it is also possible to time multiplex communications | 


between different devices if a suitable addressing scheme is 
used. We have not yet prototyped either of these multiple- 
device communications schemes. 


4.4. Interacting with ThinSight 
As shown earlier in this section, it is straightforward to sense 


and locate multiple fingertips using ThinSight. In order to | 


do this we threshold the processed data to produce a binary 
image. The connected components within this are isolated, 
and the center of mass of each component is calculated 
to generate representative X, Y coordinates of each finger. 


Avery simple homography can then be applied to map these | 


fingertip positions (which are relative to the sensor image) 
to onscreen coordinates. Major and minor axis analysis or 
more detailed shape analysis can be performed to deter- 
mine orientation information. Robust fingertip tracking 
algorithms or optical flow techniques” can be employed to 
add stronger heuristics for recognizing gestures. 

Using these established techniques, fingertips are 
sensed to within a few millimeters, currently at 23 frames/s. 
Both hover and touch can be detected, and could be disam- 
biguated by defining appropriate thresholds. A user there- 
fore need not apply any force to interact with the display. 
However, it is also possible to estimate fingertip pressure by 
calculating the increase in the area and intensity of the fin- 
gertip “blob” once touch has been detected. 

Figure 1 shows two simple applications developed using 


ThinSight. A simple photo application allows multiple | 
images to be translated, rotated, and scaled using estab- | 


lished multifinger manipulation gestures. We use distance 
and angle between touch points to compute scale factor 
and rotation deltas. To demonstrate some of the capabili- 
ties of ThinSight beyond just multitouch, we have built an 
example paint application that allows users to paint directly 
on the surface using both fingertips and real paint brushes. 
The latter works because ThinSight can detect the brushes’ 
white bristles which reflect IR. The paint application also 
supports a more sophisticated scenario where an artist’s 
palette is placed on the display surface. Although this is vis- 
ibly transparent, it has an IR reflective marker on the under- 
side which allows it to be detected by ThinSight, whereupon 
a range of paint colors are rendered underneath it. The user 
can change color by “dipping” either a fingertip or a brush 


described some of its potential; here we discuss a number of 
additional observations and ideas which came to light dur- 
ing the work. 


_ 5.1. Fidelity of sensing 


The original aim of this project was simply to detect fin- 
gertips to enable multi-touch-based direct manipulation. 


_ However, despite the low resolution of the raw sensor data, 


into the appropriate well in the palette. We identify the pres- 


ence of this object using a simple ellipse matching algo- | 


rithm which distinguishes the larger palette from smaller 
touch point “blobs” in the sensor image. Despite the lim- 
ited resolution of ThinSight, it is possible to differentiate a 
number of different objects using simple silhouette shape 
information. 


5. DISCUSSION AND FUTURE WORK 

We believe that the prototype presented in this article is an 
interesting proof-of-concept of a new approach to multi- 
touch and tangible sensing for thin displays. We have already 


we still detect quite sophisticated object images. Very small 
objects do currently “disappear” on occasion when they are 
midway between optosensors. However, we have a num- 
ber of ideas for improving the fidelity further, both to sup- 
port smaller objects and to make object and visual marker 
identification more practical. An obvious solution is to 
increase the density of the optosensors, or at least the den- 
sity of IR detectors. Another idea is to measure the amount 
of reflected light under different lighting conditions—for 
example, simultaneously emitting light from neighboring 
sensors is likely to cause enough reflection to detect smaller 
objects. 


5.2. Frame rate 

In informal trials of ThinSight for a direct manipulation 
task, we found that the current frame rate was reasonably 
acceptable to users. However, a higher frame rate would not 
only produce a more responsive UI which will be important 
for some applications, but would make temporal filtering 
more practical thereby reducing noise and improving sub- 
pixel accuracy. It would also be possible to sample each 
detector under a number of different illumination condi- 
tions as described above, which we believe would increase 
fidelity of operation. 


5.3. Robustness to lighting conditions 

The retro-reflective nature of operation of ThinSight com- 
bined with the use of background substitution seems to give 
reliable operation in a variety of lighting conditions, includ- 
ing an office environment with some ambient sunlight. 


| One common approach to mitigating any negative effects 


of ambient light, which we could explore if necessary, is to 
emit modulated IR and to ignore any nonmodulated offset 
in the detected signal. 


5.4. Power consumption 
The biggest contributor to power consumption in ThinSight 
is emission of IR light; because the signal is attenuated in 
both directions as it passes through the layers of the LCD 
panel, a high intensity emission is required. For mobile 
devices, where power consumption is an issue, we have 
ideas for improvements. We believe it is possible to enhance 
the IR transmission properties of an LCD panel by optimiz- 
ing the materials used in its construction for this purpose— 
something which is not currently done. In addition, it may 
be possible to keep track of object and fingertip positions, 
and limit the most frequent IR emissions to those areas. The 
rest of the display would be scanned less frequently (e.g. at 
2-3 frames/s) to detect new touch points. 

One of the main ways we feel we can improve on power 
consumption and fidelity of sensing is to use a more 
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sophisticated IR illumination scheme. We have been experi- 
menting with using an acrylic overlay on top of the LCD and 
using IR LEDs for edge illumination. This would allow us to 
sense multiple touch points using standard Frustrated Total 
Internal Reflection (FTIR),° but not objects. We have, how- 
ever, also experimented with a material called Endlighten 
which allows this FTIR scheme to be extended to diffuse illu- 
mination, allowing both multitouch and object sensing with 
far fewer IR emitters than our current setup. The overlay can 
also serve the dual purpose of protecting the LCD from flex- 
ing under touch. 


6. RELATED WORK 


The area of interactive surfaces has gained particular | 


attention recently following the advent of the iPhone and 
Microsoft Surface. However, itis a field with over two decades 
of history.’ Despite this sustained interest there has been an 
evident lack of off-the-shelf solutions for detecting multiple 
fingers and/or objects on a display surface. Here, we sum- 
marize the relevant research in these areas and describe the 
few commercially available systems. 


6.1. Camera-based systems 

One approach to detecting multitouch and tangible input 
is to use a video camera placed in front of or above the 
surface, and apply computer vision algorithms for sens- 
ing. Early seminal work includes Krueger’s VideoDesk'* 
and the DigitalDesk,*® which use dwell time and a micro- 
phone (respectively) to detect when a user is actually touch- 
ing the surface. More recently, the Visual Touchpad” and 
C-Slate® use a stereo camera placed above the display to 
more accurately detect touch. The disparity between the 
image pairs determines the height of fingers above the 
surface. PlayAnywhere** introduces a number of addi- 
tional image processing techniques for front-projected 
vision-based systems, including a shadow-based touch 
detection algorithm, a novel visual bar code scheme, 
paper tracking, and an optical flow algorithm for biman- 
ual interaction. 

Camera-based systems such as those described above 
obviously require direct line-of-sight to the objects being 
sensed which in some cases can restrict usage scenarios. 
Occlusion problems are mitigated in PlayAnywhere by 
mounting the camera off-axis. A natural progression is to 
mount the camera behind the display. HoloWall'* uses IR 
illuminant and a camera equipped with an IR pass filter 


behind a diffusive projection panel to detect hands and | 


other IR-reflective objects in front of it. The system can accu- 
rately determine the contact areas by simply thresholding 
the infrared image. TouchLight” uses rear-projection onto a 
holographic screen, which is also illuminated from behind 
with IR light. A number of multitouch application scenarios 
are enabled including high-resolution imaging capabilities. 
Han® describes a straightforward yet powerful technique 


for enabling high-resolution multitouch sensing on rear- | 


projected surfaces based on FTIR. Compelling multitouch 
applications have been demonstrated using this technique. 
The Smart Table” uses this same FTIR technique in a table- 
top form factor. 
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The Microsoft Surface and ReacTable” also use rear- 
projection, IR illuminant and a rear mounted IR camera to 
monitor fingertips, this time in a horizontal tabletop form- 
factor. These systems also detect and identify objects with 
TR-reflective markers on their surface. 

The rich data generated by camera-based systems pro- 
vides extreme flexibility. However, as Wilson discusses** 
this flexibility comes at a cost, including the computa- 
tional demands of processing high resolution images, sus- 
ceptibility to adverse lighting conditions and problems of 
motion blur. However, perhaps more importantly, these 
systems require the camera to be placed at some distance 
from the display to capture the entire scene, limiting their 
portability, practicality and introducing a setup and cali- 
bration cost. 


6.2. Opaque embedded sensing 

Despite the power of camera-based systems, the associ- 
ated drawbacks outlined above have resulted in a number 
of parallel research efforts to develop a non-vision-based 
multitouch display. One approach is to embed a multi- 
touch sensor of some kind behind a surface that can have 
an image projected onto it. A natural technology for this is 
capacitive sensing, where the capacitive coupling to ground 
introduced bya fingertip is detected, typically by monitoring 
the rate of leakage of charge away from conductive plates or 
wires mounted behind the display surface. 

Some manufacturers such as Logitech and Apple have 
enhanced the standard laptop-style touch pad to detect 
certain gestures based on more than one point of touch. 
However, in these systems, using more than two or three fin- 
gers typically results in ambiguities in the sensed data. This 
constrains the gestures they support. Lee et al.‘ used capac- 
itive sensing with a number of discrete metal electrodes 
arranged in a matrix configuration to support multitouch 
over a larger area. Westerman’ describes a sophisticated 
capacitive multitouch system which generates x-ray-like 
images of a hand interacting with an opaque sensing sur- 
face, which could be projected onto. A derivative of this work 
was commercialized by Fingerworks. 

DiamondTouch!’ is composed of a grid of row and col- 
umn antennas which emit signals that capacitively couple 
with users when they touch the surface. Users are also 
capacitively coupled to receivers through pads on their 
chairs. In this way the system can identify which antennas 
behind the display surface are being touched and by which 
user, although a user touching the surface at two points can 
produce ambiguities. The SmartSkin*! system consists of 
a grid of capacitively coupled transmitting and receiving 
antennas. As a finger approaches an intersection point, 
this causes a drop in coupling which is measured to deter- 
mine finger proximity. The system is capable of supporting 
multiple points of contact by the same user and generat- 
ing images of contact regions of the hand. SmartSkin and 
DiamondTouch also support physical objects, but can only 
identify an object when a user touches it. Tactex provide 
another interesting example of an opaque multitouch sen- 
sor, which uses transducers to measure surface pressure at 
multiple touch points.” 


6.3. Transparent overlays 

The systems above share one major disadvantage: they all 
rely on front-projection for display. The displayed image 
will therefore be broken up by the user’s fingers, hands and 
arms, which can degrade the user experience. Also, a large 
throw distance is typically required for projection which 
limits portability. Furthermore, physical objects can only 
be detected in limited ways, if object detection is supported 
at all. 

One alternative approach to address some of the issues 
of display and portability is to use a transparent sensing 
overlay in conjunction with a self-contained (i.e., not pro- 
jected) display such as an LCD panel. DualTouch"’ uses 
an off-the-shelf transparent resistive touch overlay to 
detect the position of two fingers. Such overlays typically 
report the average position when two fingers are touching. 
Assuming that one finger makes contact first and does 
not subsequently move, the position of a second touch 
point can be calculated. An extension to this is provided 
by Loviscach.'® 

The Philips Entertaible’? takes a different “overlay” 


approach to detect up to 30 touch points. IR emitters and | 
| applications. 


detectors are placed on a bezel around the screen. Breaks in 
the IR beams detect fingers and objects. The SMART DViT” 
and HP TouchSmart® utilize cameras in the corners of a 
bezel overlay to support sensing of two fingers or styluses. 
With such line of sight systems, occlusion can be an issue 
for sensing. 

The Lemur music controller from JazzMutant!! uses a 
proprietary resistive overlay technology to track up to 20 
touch points simultaneously. More recently, Balda AG 
and N-Trig’? have both released capacitive multitouch 
overlays, which have been used in the iPhone and the 
Dell XT, respectively. These approaches provide a robust 
way for sensing multiple fingers touching the surface, 
but do not scale to whole hand sensing or tangible 
objects. 


6.4. The need for intrinsically integrated sensing 

The previous sections have presented a number of multi- 
touch display technologies. Camera-based systems produce 
very rich data but have a number of drawbacks. Opaque sens- 
ing systems can more accurately detect fingers and objects, 
but by their nature rely on projection. Transparent overlays 
alleviate this projection requirement, but the fidelity of sens- 
ing is reduced. It is difficult, for example, to support sensing 
of fingertips, hands and objects. 

A potential solution which addresses all of these 
requirements is a class of technologies that we refer to as 
“intrinsically integrated” sensing. The common approach 
behind these is to distribute sensing across the display 
surface, integrating the sensors with the display elements. 
Hudson’ reports on a prototype 0.7" monochrome display 
where LED pixels double up as light sensors. By operating 
one pixel as a sensor while its neighbors are illuminated, 
it is possible to detect light reflected from a fingertip close 
to the display. The main drawbacks are the use of visible 
illuminant during sensing and practicalities of using LED- 
based displays. SensoLED uses a similar approach with 


visible light, but this time based on polymer LEDs and pho- 
todiodes. A 1" diagonal sensing polymer display has been 
demonstrated.’ 

Planar’ and Toshiba™ were among the first to develop 
LCD prototypes with integrated visible light photosensors, 
which can detect the shadows resulting from fingertips or 
styluses on the display. The photosensors and associated 
signal processing circuitry are integrated directly onto the 
LCD substrate. To illuminate fingers and other objects, 
either an external light source is required—impacting on 
the profile of the system—or the screen must uniformly 
emit bright visible light—which in turn will disrupt the dis- 
played image. 

The motivation for ThinSight was to build on the con- 
cept of intrinsically integrated sensing. We have extended 
the work above using invisible (IR) illuminant to allow 
simultaneous display and sensing, building on current 
LCD and IR technologies to make prototyping practical 
in the near term. Another important aspect is support 
for much larger thin touch-sensitive displays than is 
provided by intrinsically integrated solutions to date, 
thereby making it more practical to prototype multitouch 


| 7. CONCLUSION 


In this article we have described a new technique for 
optically sensing multiple objects, including fingertips, 
through thin form-factor displays. Optical sensing allows 
rich “camera-like” data to be captured by the display and 
this is processed using computer vision techniques. This 
supports new types of human computer interfaces that 
exploit zero-force multi-touch and tangible interaction 
on thin form-factor displays such as those described in 
Buxton.? We have shown how this technique can be inte- 
grated with off-the-shelf LCD technology, making such 
interaction techniques more practical and deployable in 
real-world settings. 

We have many ideas for potential refinements to the 
ThinSight hardware, firmware, and PC software. In addi- 
tion to such incremental improvements, we also believe 
that it will be possible to transition to an integrated “sens- 
ing and display” solution which will be much more straight- 
forward and cheaper to manufacture. An obvious approach 


| is to incorporate optical sensors directly onto the LCD 


backplane, and as reported earlier early prototypes in this 
area are beginning to emerge.” Alternatively, polymer pho- 
todiodes may be combined on the same substrate as poly- 
mer OLEDs?’ for a similar result. The big advantage of this 
approach is that an array of sensing elements can be com- 
bined with a display at very little incremental cost by simply 
adding “pixels that sense” in between the visible RGB dis- 
play pixels. This would essentially augment a display with 
optical multitouch input “for free,” enabling truly wide- 
spread adoption of this exciting technology. 
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sented groups are especially encouraged to apply. 

Beyond its research and academic activities, 
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cs.carleton.edu/hiring. 


Carleton is a highly selective liberal arts col- 
lege with outstanding, enthusiastic students. We 
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committed to excellence in teaching, curriculum 
design, ongoing research, and undergraduate re- 
search advising. To apply, send cover letter, CV, 
brief statements describing teaching philosophy 
and research agenda, graduate transcript, and 
three reference letters, at least one of which ad- 
dresses your teaching, to cssearch@carleton.edu. 
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write to: Tenure-Track Search Committee, Com- | 
puter Science Department, Carleton College, One | 
North College Street, Northfield, MN 55057. Ap- | 
plications completed by January 1, 2010 will re- | 
ceive full consideration. 

Carleton College is an affirmative action/ 
equal opportunity employer. We are committed 
to developing our faculty to better reflect the di- 
versity of our student body and American society. 
Women and members of minority groups are 
strongly encouraged to apply. 


Coastal Carolina University 
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of Computer Science 


The Department of Computer Science & Informa- 
tion Systems at Coastal Carolina University invites 
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sor positions beginning fall 2010. While special 
interest will be given to candidates specializing 
in data mining, computational intelligence, bio- 
medical applications, and embedded systems, 
qualified applicants from all areas of Computer 
Science are encouraged to apply. Preference 
will be given to candidates who can work across 
multiple disciplines and collaborate with other 
faculty, developing successful inter-disciplinary 
research proposals. 

The Department’s 11 faculty support an ABET 
accredited B.S. in Computer Science, and a B.S. 
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have a Ph.D. in Computer Science. Faculty are ex- 
pected to commit to quality teaching, scholarship 
and service. 

The University is a public mid-sized, compre- 
hensive liberal arts-oriented institution. The Uni- 
versity is located in Conway, South Carolina, just 
nine miles from the Atlantic coastal resort of Myr- 
tle Beach, one of the fastest-growing metropoli- 
tan areas in the nation. It has an enrollment of 
8,200 students and is expected to have continued 
growth for the next several years. The University 
is a part of the South Carolina system of public 
education and has close ties with its founders, the 
Horry County Higher Education Commission. 

Interested candidates should apply online at: 
http://jobs.coastal/edu. Candidates are asked to 
submit: a letter of application, curriculum vitae, 
three letters of reference, and two conference 
proceedings or journal articles, along with a 4-5 
paragraph statement of teaching philosophy. Re- 
view of applications will begin immediately and 
continue until the positions are filled. 

Applicants must have a Ph.D. in Computer 
Science. Faculty are expected to commit to qual- 
ity teaching, scholarship and service. 

While special interest will be given to candi- 
dates specializing in data mining, computational 
intelligence, biomedical applications, and em- 
bedded systems, qualified applicants from all ar- 
eas of Computer Science are encouraged to apply. 
Preference will be given to candidates who can 
work across multiple disciplines and collaborate 
with other faculty, developing successful inter- 
disciplinary research proposals. 

Coastal Carolina University 
is an EO/AA employer. 


College of William & Mary 
Faculty Position — Assistant Professor 


We invite applications for a tenure-track assistant 
professor position in Computer Science for Fall 
2010. We are interested in individuals with re- 
search expertise in all areas of computer science. 
Applicants must hold a Ph.D. in computer science 
orarelated field at the time of appointment, must 
have a strong research record, and should have an 
interest in teaching. 

The College of William and Mary, consistently 
ranked in the elite group of the Best National Uni- 
versities Doctoral by U.S. News and World Report, 
has committed to a multi-year effort to strength- 
en its computer science research program. The 
department currently consists of fourteen faculty 
members who support B.S., M.S., and Ph.D. pro- 
grams. More information about the department 
and university can be obtained at: 

http://www.cs.wm.edu. 


Applicants should submit a current resume, 
research and teaching statements, the names of 
at least three references, and supporting docu- 
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ments they consider most relevant. We prefer that 
the application materials be submitted as PDF at- 
tachments in email to 09search@CS.WM.EDU. If 
necessary, hard copy may be submitted to: 

Faculty Search Committee 

Department of Computer Science 

The College of William & Mary 

P.O. Box 8795 

Williamsburg, VA 23187-8795 


Review of applications will begin December 
15 and continue until the position is filled. The 
College is an EEO/AA employer. 


Duke University 
Department of Computer Science 


The Department of Computer Science at Duke 
University invites applications and nominations 
for tenure-track faculty positions at an assistant 
professor level, to begin August 2010. We are in- 
terested in strong candidates in all active research 
areas of computer science, both core and interdis- 
ciplinary areas, including algorithms, artificial in- 
telligence, computational economics, computer 
architecture, computer vision, database systems, 
distributed systems, machine learning, network- 
ing, security, and theory. 

The department is committed to increasing 
the diversity of its faculty, and we strongly en- 
courage applications from women and minority 
candidates. 

A successful candidate must have a solid 


disciplinary foundation and demonstrate prom- 
ise of outstanding scholarship in every respect, 
including research and teaching. Please refer to 
www.cs.duke.edu for information about the de- 
partment and to www.provost.duke.edu/faculty/ 
for information about the advantages that Duke 
offers to faculty. 

Applications should be submitted online at 
www.cs.duke.edu/facsearch. A Ph.D. in computer 
science or related area is required. To guarantee | 
full consideration, applications and letters of ref- 
erence should be received by January 3, 2010. 

Durham, Chapel Hill, and the Research Tri- 
angle of North Carolina are vibrant, diverse, and 
thriving communities, frequently ranked among 
the best places in the country to live and work. 
Duke and the many other universities in the area 
offer a wealth of education and employment op- 
portunities for spouses and families. 

Duke University is an affirmative action, equal 
opportunity employer. 


Duke University 
Tenure-track Faculty 


The Department of Electrical and Computer En- 
gineering at Duke University invites applications 
for tenure-track faculty positions at all levels. We 
are interested in strong candidates in all areas of | 
computer engineering. 

Applications should be submitted online at 
www.ee.duke.edu/employment. 

Applications and letters of reference should 


be received by December 31, 2009. 
Duke University is an affirmative action, equal 
opportunity employer. 


Earlham College 
Visiting Assistant Professor of Computer 
Science 


One-year, full-time, non-tenure-track beginning 
Fall 2010. Review of applications begins January 
15, 2010. CS at Earlham is a small, dynamic, inno- 
vative, student-centered department. See http:// 
cs.earlham.edu/ad.html 


The George Washington University 
Department of Computer Science 
Three Faculty Positions 


The Department of Computer Science at The 
George Washington University is seeking appli- 
cants for three faculty positions. The first is (A) a 
tenured senior position in security, at the rank of 
Associate or Full Professor. The second position 
is (B) a tenure-track position in the area systems 
security, and the third is (C) a tenure-track posi- 
tion in the area of Artificial Intelligence with a 
focus on Robotics. Position B and C will be at the 
rank of Assistant or Associate Professor. The rank 
for all positions will depend on experience. Suc- 
cessful candidates may start as early as Fall 2010. 

Basic Qualifications: All applicants must have 
a Ph.D degree in Computer Science or a closely 


The University of New Mexico 


Great people doing great things 
Computer Science Assistant Professor Position 


The Department of Computer Science invites applications for one probationary 
appointment leading to a tenure decision at the Assistant Professor level. We seek 
applicants from across computer science, including but not limited to large data 
systems, networks, software design and engineering, graphics and visualization, and 
AI and adaptive systems. We are a strongly interdisciplinary department and are 
particularly interested in applicants pushing the boundaries of computer science 


with other fields. 


For more information refer to full job ad at http://www.cs.unm.edu/jobs/ 


Candidates must have completed a doctorate in CS or a relevant area by August 15, 
2010. Applicants should demonstrate a strong commitment to undergraduate and 
graduate education, have a research profile in one of the general CS areas 
enumerated above, and demonstrate the ability to establish a nationally visible 


research program. 


For best consideration, complete applications must be received by January 11, 2010. 
The position will remain open until filled. Each application must include a cover 
letter summarizing the applicant’s experience, curriculum vita, research statement, 


teaching statement, and three letters of reference. 


It is the applicant’s 


responsibility to ensure that the letters of reference are submitted before the 


application deadline. 


Applications must be submitted online through https://UNMJobs.unm.edu, by 
referencing posting #0803446. Reference letters should be emailed directly to 


faculty_search@cs.unm.edu 


Inquiries should be sent to: faculty_search@cs.unm.edu 


The University of New Mexico is an Equal Opportunity/Affirmative Action Employer and Educator. 
For additional information about UNM see http://www.unm.edu 
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INFORMATION SCIENCE 
Cornell University (Ref: ISF#3) 


Cornell's Information Science Program 
(www.infosci.cornell.edu) is seeking to fill one 
or more tenure-track faculty positions. 


Information Science at Cornell brings together 
those interested in studying information 
systems in their social, cultural, economic, 
historical, legal, and political contexts. We are 
especially interested in the areas of: 
¢ large information networks and communities 
with a focus on methods for their design, 
analysis, or visualization; 
analysis of socio-technical systems with a 
focus on their governance or the practices 
that emerge within them. 


We will also consider candidates in other areas 
of Information Science. The ideal applicant will 
have a strong mix of both technical and social 
science research skills and accomplishments. 
Please see www.infosci.cornell.edu/jobs for a 
full description of the positions. 

To ensure full consideration, applications should 
be received by January 15, 2010, but will be 
accepted until all positions are filled. Applicants 
should submit a curriculum vita, brief statements 
of research and teaching interests through the 
web at http://www.infosci.cornell.edu/apply, 
and arrange to have at least three letters of 
reference uploaded on the Web. 


Cornell University 


Cornell University is an Affirmative Action 
Equal Opportunity Employer and Educator. 


related field. (A) Applicants to the tenured senior 
position in security must currently have well- 
funded research programs, must be a recognized 
scholar in the research community, and must be 
prepared to take on a leading research role within 
the department and in the field. (B and C) ABD 
candidates may apply for the Assistant Professor 
rank, but they must complete their Ph.D degree 
by August 15, 2010. Applicants for the Assistant 
Professor rank must demonstrate a potential for 


developing an advanced research program and | 


for attracting significant research funding. Appli- 
cants for the Associate Professor rank must have 
well-established and well-funded research pro- 
grams. All applicants must have demonstrated 
teaching excellence or potential at both under- 
graduate and graduate levels. 

The George Washington University is the larg- 
est academic institution in the nation’s capital 
with close access to many Federal funding agen- 
cies and research laboratories. The University 
offers comprehensive programs of undergradu- 
ate and graduate liberal arts studies as well as 
degrees in engineering, law, medicine, public 
health, education, business and international 
affairs. A private institution, GW prides itself on 
excellent research, quality education, and low 
student-teacher ratio. The exceptional location 
affords the GW community unique cultural and 
intellectual opportunities. In the high-tech sec- 
tor, the Washington, DC Metropolitan area is one 
of the largest IT areas in the nation, putting us in 
the center of activities such as security and bio- 
technology. 

The Department of Computer Science of- 
fers an accredited Bachelor of Science program, 
a Bachelor of Arts program, and Master’s and 


Ph.D degrees. The Department has 18 faculty | 
members, numerous affiliated and adjunct fac- | 


ulty members, and over 425 students. The Depart- 
ment has educational and research programs in 


security, networks, graphics, biomedical appli- | 


Indiana University Southeast 


| Assistant Professor of Informatics 


cations, search and data mining, human com- | 


puter interaction, and machine intelligence, with 
funding from various agencies; a NSA-designated 
Center of Academic Excellence and Center of Aca- 
demic Excellence-Research in security, with fund- 
ing from NSF, DOD, and other agencies; and NIH- 
funded collaborations with the medical school 
in the biomedical areas. For further information 
please refer to http://www.cs.gwu.edu. 

Review of applications will begin on Decem- 
ber 1, 2009, and will continue through the Spring 
2010 semester, until the position is filled. Appli- 
cation Procedure: To be considered, applicants 
must send a letter containing (i) a brief statement 
that clearly indicates the position and rank of in- 
terest, (ii) a curriculum vita, (iii) a statement of re- 
search and teaching interests, and (vi) complete 
contact information for at least three references. 
These and other relevant supporting materials 
should be sent to: Chair, Faculty Search Commit- 
tee, Department of Computer Science, PHIL 703, 
The George Washington University, Washington 
D.C. 20052. Electronic submissions are preferred, 
and can be sent to cssearch@gwu.edu. Only com- 
plete applications will be considered. Inquiries 
about applying will be accorded the utmost dis- 
cretion. For complete instructions on the applica- 
tion process, please visit the department website 
www.cs.gwu.edu. 

The George Washington University is an equal 
opportunity/affirmative action employer. 


Tenure-track Informatics position for Fall 2010. 
Applications accepted until January 15, 2010 or 
until position is filled. For further position de- 
scription and how to apply go to www.ius.edu/hr/ 
employment_opportunities.cfm 


IST Austria is looking for 
Professors and Assistant Professors 


IST Austria (Institute of Science and Technology 
Austria) is a new Institute located near Vienna, 
dedicated to basic research at the highest inter- 
national level. The Institute invites applications 


and nominations for Professors and Assistant 


Professors in Life Sciences, Physical Sciences, 
Mathematics and Computer Science, as well as in 
any multidisciplinary field. 

The Institute (www.ist.ac.at), established by 
the Austrian Government, opened its campus in 
2009. Its funding is substantial, allowing for over 
500 employees and graduate students by 2016. 
IST Austria is entitled to award Ph.D. degrees and 
includes an English-language Graduate School. 
It aims to achieve an international mix of scien- 
tists and recruit them solely on the basis of their 
individual excellence and potential contribution 
to research. 

The Institute is recruiting leaders of inde- 
pendent research groups. Professors will have 
indefinite contracts and Assistant Professors will 
have fixed-term contracts for an initial period of 


five years, with a possible, but not automatic, re- 
newal for two additional years. Before the end of 
this period, the scientist will be considered for an 
indefinite appointment as a Professor at IST Aus- 
tria, the decision being based on merit only (as is 
the case for a “Tenure-Track Assistant Professor” 
at U.S. universities). 

The selected candidates will receive a com- 
petitive salary and a substantial annual research 
budget, covering operating expenses and the 
cost of Ph.D. students, postdoctoral fellows, and 
technical staff. Additional costs of starting a new 
laboratory, including instruments and infrastruc- 
ture, will be offered separately. Scientists are also 
expected to apply for external research grants. 

Applications and nominations should be sent 
to professor@ist.ac.at or assistant.professor@ist. 
ac.at, depending on the relevant position. Appli- 
cations must include a CV, a list of publications 
anda research plan. Nominations should include 
an appraisal of the achievements and scientific 
qualifications of the nominee. 

IST Austria is committed to Equality and Di- 
versity. In particular female applicants are en- 
couraged to apply. 


The Johns Hopkins University 
Tenure-track Faculty Positions 


The Department of Computer Science at The Johns 
Hopkins University is seeking applications for 
tenure-track faculty positions at the Assistant Pro- 
fessor level, as well as applications for a focused 
search at the Associate or Full Professor level. 


ate 


ECOLE POLYTECHNIQUE 
FEDERALE DE LAUSANNE 


The School of Computer and Communi- 
cation Sciences at EPFL invites applica- 
tions for faculty positions in computer 
science. We are primarily seeking candi- 
dates for tenure-track assistant pro- 
fessor positions, but suitably qualified 
candidates for senior positions will also 
be considered. 


Successful candidates will develop an 
independent and creative research pro- 
gram, participate in both undergraduate 
and graduate teaching, and supervise PhD 
students. 


Candidates from all areas of computer 
science will be considered, but preference 
will be given to candidates with interests 
in algorithms, bio-informatics, machine 
learning and verification. 


EPFL offers internationally competitive 
salaries, significant start-up resources, 
and outstanding research infrastructure. 


Ecole polytechnique fédérale de Lausanne 


Faculty Positions 
in Computer Science 


To apply, please follow the application 
procedure at http://icrecruiting.epfl.ch. 
The following documents are requested in 
PDF format: curriculum vitae, including 
publication list, brief statements of re- 
search and teaching interests, names and 
addresses (including e-mail) of 3 refe- 
rences for junior positions, and 6 for 
senior positions. Screening will start on 
January 15, 2010. Further questions can 
be addressed to : 


Professor Willy Zwaenepoel 
Dean 

School of Computer and 
Communication Sciences EPFL 
CH-1015 Lausanne, Switzerland 
recruiting.ic@epfl.ch 


For additional information on EPFL, please 
consult: http://www.epfl.ch or 
http://ic.epfl.ch 


EPFL is an equal opportunity employer. 
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At the Assistant Professor level all areas will be 
considered, but candidates with research agen- 
das in computational genomics, computer sys- 
tems, and health-related applications will receive 
special attention. At the Associate or Full Profes- | 
sor level we are seeking outstanding applicants in 
the security area. All applicants must havea Ph.D. | 
in computer science or a related field and are ex- 
pected to show evidence of an ability to establish 
a strong, independent, multidisciplinary, inter- 
nationally recognized research program. 

Commitment to quality teaching at the un- 
dergraduate and graduate levels will be required 
of all candidates considered. The department | 
webpage at http://www.cs.jhu.edu provides infor- 
mation about the department, including links to 
research laboratories and centers. 

Applicants should apply using the online ap- | 
plication which can be accessed from https:// 
academicjobsonline.org/ajo/JHU/CS/201 (for full 
consideration, by January 4 2010). Questions 
should be directed to fsearch@cs.jhu.edu. The | 
Department is committed to building a diverse 
educational environment; women and minorities 
are strongly encouraged to apply. The Johns Hop- 
kins University is an EEO/AA employer. 

Faculty Search 

Johns Hopkins University 

Department of Computer Science 

Room 224 New Engineering Building 

Baltimore, MD 21218-2694 

Fax: 410-516-6134 

Phone: 410-516-8775 

fsearch@cs.jhu.edu 

http://www.cs.jhu.edu/apply 


Lawrence Technological University 
Assistant Professor of Computer Science 


For appointment in August 2010. The ideal can- 
didate will have a Ph.D. degree in computer sci- 
ence, be primarily committed to the development 
of undergraduate and professional graduate 
computer science students through teaching, ap- 
plied projects and scholarship, be able to work 
effectively in interdisciplinary teams, and believe 
strongly in the value of both theory and applica- 
tion. Applicants should email a cover letter, cur- 
riculum vitae, statement of teaching philosophy 
and research interests, and three letters of recom- 
mendation. 


Lehigh University 

Department of Computer Science and 
Engineering 

Faculty Opening 


Applications are invited for a tenure-track Assis- 
tant Professor position in Bioinformatics in the 
Computer Science and Engineering Department 
(http://www.cse.lehigh.edu) of Lehigh University. 
The position starts in August 2010. 

The potential to establish a successful re- 
search program and teach effectively at both the 
undergraduate and graduate level are essential 
prerequisites. The successful applicant will holda 
Ph.D. in Computer Science, Computer Engineer- 
ing, Bioinformatics, ora closely related field. Out- 
standing candidates in areas of computer science 
with a direct connection to bioinformatics will be 


The University of New Mexico 
Great people doing great things 


Dean, School of Engineering 


The University of New Mexico is seeking a visionary leader for the position of Dean of the School 
of Engineering (SOE). Located in Albuquerque, UNM is a Carnegie Very High Research University 
and a federally-designated Hispanic-Serving Institution. Founded in 1889, UNM is the largest and 
most comprehensive of the state’s institutions of higher education. The Dean, who reports directly 
to the Provost/Executive Vice President for Academic Affairs, provides academic and administra- 
tive leadership to the SOE. As the primary academic and administrative officer of the faculty of the 
School of Engineering, the Dean will assume a central leadership role in continuing development 
of the SOE’s disciplines toward national eminence with a commitment to research and teaching. 
The Dean is responsible for improving and promoting the SOE in areas of instruction, research, fis- 
cal management, development and personnel. The School of Engineering (SOE) has over 100 fac- 
ulty in five academic departments: Chemical and Nuclear Engineering, Civil Engineering, 
Computer Science, Electrical and Computer Engineering, and Mechanical Engineering. Among the 
internationally recognized centers associated with the school are the Center for High Technology 
Materials, the Center for Microengineered Materials, the Center for Advanced Research 
Computing, the Mind Research Network, the Center for Biomedical Engineering, and the Center 
for Emerging Energy Technologies. UNM is also a member institution of the WERC-A Consortium 
for Environmental Education & Technology Development. The SOE is ranked nationally for both 
its undergraduate and graduate programs. 


Minimum qualifications for the position include an earned Ph.D. in Engineering or Computer 
Science or doctoral degree in a related discipline, a record of accomplishment or credentials that 
would merit appointment at the rank of tenured full professor in an academic department of UNM’s 
School of Engineering, and three years of management and administrative experience at the level 
of department head, chair, research director, center director, dean, or the equivalent. 


For a complete listing of the position description including desired qualifications and application 
procedures, access the job posting on UNM’s online application system, UNMJobs.unm.edu, by 
referencing posting number 0803339. For best consideration, applications should be submitted by 
January 11, 2010, and the position is open until filled. Request a copy of the position announcement 
from Bonnie Leigh Reifsteck, Search Coordinator, bonniec@unm.edu (phone: 505.277.2611). 


The University of New Mexico is an Equal Opportunity/Affirmative Action Employer and Educator. 
For additional information about UNM see http://www.unm.edu 
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considered. Applicants should have an interest in 
teaching core courses in computer science as well 
as courses in their research area. The successful 
applicant will be expected to contribute directly 
to Lehigh’s bioengineering program. 

The faculty of the Computer Science and En- 
gineering (CSE) department includes IEEE and 
ACM fellows, and four NSF CAREER award win- 
ners. We offer B.A., B.S., M.S., and Ph.D. degrees 
in Computer Science and jointly oversee B.S., 
M.S., and Ph.D. degree programs in Computer 
Engineering with the department of Electrical 
and Computer Engineering. We also offer a B.S. 
in Computer Science and Business with the Col- 
lege of Business and Economics. Lehigh offers 
a degree in Bioengineering and has a variety of 
research programs related to bioinformatics and 
medical informatics, including biomedical image 
processing, biopharmaceuticals, algorithms for 
mining large genomic databases, and assistive 
robotics. Lehigh is developing a biotech cluster 
that includes a vibrant undergraduate program, 
a new graduate program in bioengineering, both 
parts of a university-level initiative in healthcare 
research. The bioengineering program draws as- 
sociated faculty from nearly all engineering and 
several science departments and will provide 
opportunities for the successful applicant to de- 
velop collaborative research projects. 

Lehigh is a private institution that is consis- 
tently ranked among the top 40 national research 
universities by U.S. News & World Report and is 
rated “most selective” by both Barron’s and Pe- 
terson’s guides. Located in Bethlehem, Pennsyl- 
vania, Lehigh is 80 miles west of New York City 
and 50 miles north of Philadelphia, providing an 
accessible and convenient location that offers an 
appealing mix of urban and rural lifestyles. 

Applications can be submitted online at 
http://www.cse.lehigh.edu/faculty-search, and 
should include a cover letter, vita, and both teach- 
ing and research statements. In addition, please 
provide the names and email addresses of at least 
three references. Applications will be evaluated 
until the position is filled, but materials should 
be received by January 15, 2010 for full consider- 
ation. Lehigh University is an Equal Opportunity/ 
Affirmative Action Employer. Lehigh University 
provides comprehensive benefits including do- 
mestic partner benefits (see http://www.lehigh. 
edu/~insloan). Questions concerning this search 
may be sent to faculty-search@cse.lehigh.edu. 


Massachusetts Institute of Technology 
Faculty Positions 


The Department of Electrical Engineering and Com- 
puter Science (EECS) seeks candidates for faculty 
positions starting in September 2010. Appointment 
would be at the assistant or untenured associate 
professor level. In special cases, a senior faculty ap- 
pointment may be possible. Faculty duties include 
teaching at the graduate and undergraduate levels, 
research, and supervision of student research. We 
will consider candidates with backgrounds and 
interests in any area of electrical engineering and 
computer science. Faculty appointments will com- 
mence after completion of a doctoral degree. 
Candidates must register with the EECS 
search website at https://eecs-search.eecs.mit. 
edu, and must submit application materials elec- 
tronically to this website. Candidate applications 


should include a description of professional in- 
terests and goals in both teaching and research. 
Each application should include a curriculum 
vita and the names and addresses of three or 
more individuals who will provide letters of rec- 
ommendation. Letter writers should submit their 
letters directly to MIT, preferably on the website 
or by mailing to the address below. Please submit 
complete application by December 15, 2009. 

Send all materials not submitted on the web- 
site to: 

Professor W. Eric L. Grimson 

Department Head, Electrical Engineering 

and Computer Science 

Massachusetts Institute of Technology 

77 Massachusetts Avenue, Room 38-401 

Cambridge, MA 02139 


M.I.T. is an equal 
opportunity/affirmative action employer. 


Max Planck Institute for Software 
Systems (MPI-SWS) 


Tenure-track openings 


Applications are invited for tenure-track and 
tenured faculty positions in all areas related to 
the design, analysis and engineering of software 


systems, including programming languages, for- | 
| Assistant Professor 


mal methods, security, distributed, networked 
and embedded systems, databases and informa- 
tion systems, and human-computer interaction. 
A doctoral degree in computer science or related 
areas and an outstanding research record are 
required. Successful candidates are expected to 
build a team and pursue a highly visible research 
agenda, both independently and in collaboration 
with other groups. Senior candidates must have 
demonstrated leadership abilities and recog- 
nized international stature. 

MPI-SWS, founded in 2005, is part of a net- 


work of eighty Max Planck Institutes, Germany's | 
premier basic research facilities. MPIs have an | 


established record of world-class, foundational 
research in the fields of medicine, biology, chem- 
istry, physics, technology and humanities. Since 
1948, MPI researchers have won 17 Nobel prizes. 


MPI-SWS aspires to meet the highest standards of | 


excellence and international recognition with its 

research in software systems. 

To this end, the institute offers a unique envi- 
ronment that combines the best aspects of a uni- 
versity department and a research laboratory: 

a) Faculty receive generous base funding to build 
and lead a team of graduate students and post- 
docs. They have full academic freedom and 
publish their research results freely. 

b) Faculty have the opportunity to supervise doc- 
toral theses, teach graduate and undergradu- 
ate courses, and have the flexibility to incorpo- 
rate teaching into their research agenda. 

c) Faculty are provided with outstanding techni- 
cal and administrative support facilities as 
well as internationally competitive compensa- 
tion packages. 


Funds have been committed to grow the insti- 
tute to a strength of 17 tenured and tenure-track fac- 
ulty, and about 100 doctoral and post-doctoral posi- 
tions. Additional growth through outside funding is 
expected. We maintain an open, international and 
diverse work environment and seek applications 


from outstanding researchers regardless of nation- | appointment starting in the Fall semester 2010. 


al origin or citizenship. The working language is 
English; knowledge of the German language is not 
required for a successful career at the institute. 

The institute is located in Kaiserslautern and 
Saarbruecken, in the tri-border area of Germany, 
France and Luxembourg. The area offers a high 
standard of living, beautiful surroundings and 
easy access to major metropolitan areas in the 
center of Europe, as well as a stimulating, com- 
petitive and collaborative work environment. In 
immediate proximity are the MPI for Informatics, 
Saarland University, the Technical University of 


Kaiserslautern, the German Center for Artificial | 
Intelligence (DFKI), and the Fraunhofer Insti- | 
_ existing expertise in the department; we are also 


tutes for Experimental Software Engineering and 
for Industrial Mathematics. 

Qualified candidates should apply online at 
http://www.mpi-sws.org/application. The review 
of applications will begin on January 4, 2010, and 
applicants are strongly encouraged to apply by 
that date; however, applications will continue to 
be accepted through January 2010. 

The institute is committed to increasing the rep- 
resentation of minorities, women and individuals 
with physical disabilities in Computer Science. We 
particularly encourage such individuals to apply. 


New Mexico State University 


The Computer Science Department at New Mexico 


| State University invites applications for a tenure- 


track position at the assistant professor level, with 


We are particularly interested in candidates with 
expertise in database management systems, data 
mining or related areas. Applications from wom- 
en and members of traditionally under-represent- 
ed groups are particularly encouraged. Salary and 
start-up package will be competitive and com- 
mensurate with qualifications and experience. 
The minimum qualifications are a Ph.D. de- 
gree in Computer Science or in a closely-related 
discipline by the time of appointment, along with 
demonstrated evidence of excellence in teaching 
and research. We particularly solicit applications 
from candidates with attitude for collaborative re- 
search, whose research foci can complement the 


interested in individuals that can promote inter- 
disciplinary research initiatives. The successful 
candidate will be expected to develop an indepen- 
dent research program and teach graduate and 
undergraduate courses in Computer Science. 


New York University/Courant Institute 
of Mathematical Sciences 
Department of Computer Science 


The department expects to have several regular 
faculty positions beginning in September 2010 
and invites candidates at all levels. We will consid- 
er outstanding candidates in any area of computer 
science with systems being high-priority area. 
Faculty members are expected to be outstand- 
ing scholars and to participate in teaching at all lev- 
els from undergraduate to doctoral. New appoin- 


ANUS 


National University 
of Singapore 


TEMASEK RESEARCH FELLOWSHIP 


The Temasek Research Fellowship (TRF) is a prestigious 
scheme aimed at recruiting outstanding young researchers 
at the post-doctoral level to undertake research as Principal 
Investigators and lead teams to undertake defence-related 
research in the National University of Singapore (NUS). 


The TRF is a 3-year Fellowship with an option to extend 
up to 3 years. The Temasek Research Fellow (TRF-RF) 


may be offered a faculty appointment with NUS at the end 
of their term. 


In addition to an attractive remuneration package that 
will commensurate with qualification and experience, the 
TRF-RF will be provided a research grant to pursue his/ 
her research at Temasek Laboratories at NUS or other 
research entities at NUS. 


For more information and application procedure, please 
visit NUS at 


http://www.nus.edu.sg/dpr/funding/fellowship.htm 
Closing date: 22 December 2009 


Short-listed candidates will be invited for an interview and 
scientific presentation expected to be held in March 2010 
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tees will be offered competitive salaries and startup 
packages, with affordable housing within a short 
walking distance of the department. New York Uni- 
versity is located in Greenwich Village, one of the 
most attractive residential areas of Manhattan. 

The department has 33 regular faculty mem- 
bers and several clinical, research, adjunct, and 
visiting faculty members. The department’s cur- 
rent research interests include algorithms, cryp- 
tography and theory; computational biology; dis- 
tributed computing and networking; graphics, 
vision and multimedia; machine learning; natu- 
ral language processing; scientific computing; 
and verification and programming languages. 

Collaborative research with industry is facili- 
tated by geographic proximity to computer sci- 
ence activities at AT&T, Google, IBM, Bell Labs, 
NEC, and Siemens. 

Please apply at 

https://webern.cs.nyu.edu/faculty_ 
applications/ 


To guarantee full consideration, applica- 
tions should be submitted no later than January 
4, 2010; however, this is not a hard deadline, as 
all candidates will be considered to the full extent 
feasible, until all positions are filled. Visiting po- 
sitions may also be available. 

New York University is an equal opportunity/ 
affirmative action employer. 


Northwestern University 
Tenure-Track Position in Technology and 
Social Behavior 


The Department of Communication Studies in 
the Northwestern University School of Commu- 
nication seeks to hire for a tenure-track appoint- 
ment beginning September 1, 2010. Rank will be 
at the level of assistant professor. 

We are looking for a candidate who can con- 
tribute to a strong interdisciplinary program in 
technology and social behavior. Possible areas of 
expertise include but are not limited to: human- 
computer interaction, computer-mediated com- 
munication and information infrastructures, lan- 
guage and behavior in virtual communities, and 
media and social networks. 

The Department of Communication Studies 
supports a popular undergraduate major and 
graduate programs in Media, Technology, and | 
Society, Interaction and Social Influence, and 
Rhetoric and Public Culture, as well as the inter- 
disciplinary Ph.D. program in Technology and So- | 
cial Behavior. Scholarship includes leading work 
on new media, social networks, and the cultural | 
determination of the public sphere. Through spe- 
cial resources for research support and scholarly 
event programming, the department is able to of- 
fer rich opportunities for scholarly development. 

For full consideration, please send a letter of | 
application with a curriculum vitae, evidence of 
teaching effectiveness, and three letters of refer- 
ence to Professor Daniel O’Keefe at the address 
below by November 15. 

Northwestern University is an equal opportu- 
nity, affirmative action educator and employer. 
Women and minorities are encouraged to apply. 
Hiring is contingent on eligibility to work in the 
United States. 

Technology and Social Behavior Search Chair: 
Daniel O’Keefe 
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Search No. 00050140 

Department of Communication Studies 
2240 Campus Drive 

Evanston, I] 60208 
tsb-search@northwestern.edu 


Old Dominion University 
Richard T. Cheng Endowed 
Chair in Computer Science 


Applications are invited for the Richard T. Cheng 
Chair at the tenured rank of full professor. The 
Computer Science Department offers BS, MS and 
PhD degrees and features research programs in 
the areas of bioinformatics, computational sci- 
ence, digital libraries, computer and wireless 
networking, security, and computer simulation. 
Old Dominion University has major collabora- 
tive opportunities with the Virginia Modeling and 
Simulations Center, NASA Langley, the Thomas 
Jefferson National Laboratory, the Eastern Vir- 
ginia Medical School, and the DoD Joint Training 
Analysis and Simulation Center. 

We are seeking an internationally recognized 
scholar with a significant ongoing externally 
funded research program to fill this endowed po- 
sition. The Cheng Chair will be expected to play 
a leadership role in departmental research and 
to mentor junior faculty, post doctoral fellows 
and graduate students. The department is par- 
ticularly interested in attracting scholars who will 
enable interdisciplinary collaboration. Minimum 
requirements include a PhD in Computer Science 
ora closely related field. 

Applicants should submit a letter of interest 
and curriculum vitae that includes contact infor- 
mation for four references. Applications, nomina- 
tions or inquiries should be sent to: chengsearch@ 
cs.odu.edu or to: Chair, Cheng Chair Committee, 
Computer Science Department, 3300 E & CS Bldg, 
Old Dominion University, Norfolk, VA 23529-0223. 

Screening of applicants will begin immedi- 
ately and continue until the position is closed. 
Old Dominion University is an affirmative action, 
equal opportunity institution and requires com- 
pliance with the Immigration Reform and Con- 
trol Act of 1986. 


Old Dominion University 
Computer Science Faculty Positions 


The College of Sciences at Old Dominion Univer- 
sity is undergoing a major expansion to increase 
research activities of faculty and to support the 
recruiting of outstanding graduate and under- 
graduate students. The Department of Computer 
Science has recently hired a new chair and seeks 
candidates for multiple tenure track/tenured po- 
sitions at senior and junior levels. 

Candidates for full or associate professor 
must show evidence of strong, active research 
programs as evidenced by publications and peer- 
reviewed funding and demonstrate a strong teach- 
ing record. Candidates at the assistant professor 
level must demonstrate the ability to build strong 
research programs and perform quality teaching. 
The department offers competitive salaries and 
substantial start-up packages. The Department 
of Computer Science is well supported by peer- 
reviewed grants from NSF, NASA, DOE and other 
funding agencies. Department is particularly in- 
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terested in attracting researchers who will enable 
interdisciplinary collaboration. Areas of interest 
are broad and include bioinformatics, computa- 
tional biology, parallel computing, modeling and 
simulation, computational science, databases/ 
data mining, but all strong applications will be 
considered. 

Interested candidates should submit a curric- 
ulum vitae, a statement of research activities and 
| future research plans, contact information for 
four references and a statement of teaching phi- 
losophy. Applicants should specify the position 
and level for which they are applying. Electronic 
applications are preferred and should be sent to 
searchcommittee@cs.odu.edu. Paper applica- 
tions can be sent to: 

Search Committee, Department of Computer 
Science, Old Dominion University, Norfolk, VA 
23529-0162. 

Applicants from dual-career couples are wel- 
| come. Review of applicants will begin immediate- 
ly and continue until the positions are filled. 

Old Dominion University is an affirmative ac- 
tion, equal opportunity institution and requires 
compliance with the Immigration Reform and 
Control Act of 1986. 


Rochester Institute of Technology 
Computing and information sciences 
Faculty Openings for Fall 2010 


Rochester Institute of Technology’s B. Thomas 
Golisano College of Computing and Information 
Sciences (GCCIS) invites applications and nomi- 
nations for the following positions: 

A tenure-track faculty position in the Net- 
working, Security, and Systems Administration 
department (www.nssa.rit.edu) primarily in the 
area of network/computer security and informa- 
tion assurance. Additionally, experience in com- 
puter networking and/or systems administration 
is preferred (IRC #34428) 

A tenure-track faculty position in the Interac- 
tive Games and Media department (igm.rit.edu/ 
jobs) in the areas of computer games and interac- 
| tive entertainment (IRC#34427). 
| Successful candidates must demonstrate ex- 
cellence in teaching and scholarship and have 
the ability to contribute in meaningful ways to 
RIT’s commitment to cultural diversity and plu- 
| ralism. A Ph.D. in computing or a closely related 

field or other terminal degree (such as an MFA 

for IRC#34427) with equivalent experience is 
| required, as is a record of publicly disseminated 
| scholarly and/or creative work and demonstrated 
potential for future scholarship. 

The Golisano College of Computing and In- 
formation Sciences is home to the Computer 
Science, Software Engineering, Information Sci- 
ences and Technologies, Interactive Games & 
Media, and Networking, Security, and Systems 
Administration departments, as well as the Ph.D. 
program in Computer and Information Sciences. 
The college has 105 faculty and over 2400 under- 
graduate and 600 graduate students. 

Candidates should visit https://mycareer.rit. 
edu and refer to the IRC number listed above for 
specific information about the positions and 
the application process. Refer to www.rit.edu 
for information about RIT and the B. Thomas 
Golisano College of Computing and Informa- 
tion Sciences. 


Rutgers University 
Tenure-Track Position 


The Department of Computer Science at Rutgers 
University invites applications for tenure-track 
faculty positions at the rank of Assistant, Associ- 
ate or full Professor, with appointments starting 
in September 2010, subject to the availability of 
funds. All areas in experimental computer sys- 
tems will be considered, but special emphasis 
will be given to pervasive computing and com- 
puter architecture. 

Applicants for this research/teaching posi- 
tion must, at minimum, be in the process of com- 
pleting a dissertation in Computer Science or a 
closely related field, and should show evidence 
of exceptional research promise, potential for de- 
veloping an externally funded research program, 


and commitment to quality advising and teaching 


at the graduate and undergraduate levels. Hired 


candidates who have not defended their Ph.D. by 


September will be hired at the rank of Instructor, 
and must complete the Ph.D. by December 31, 
2010 to be eligible for tenure-track title retroac- 
tive to start date. 

Applicants should go to http://www.cs.rutgers. 
edu/employment/ and submit their curriculum 
vitae, a research statement addressing both past 
work and future plans and a teaching statement 
along with three letters of recommendation. 

Applications should be received by January 2, 
2010 for full consideration. 

Rutgers subscribes to the value of academic 
diversity and encourages applications from indi- 
viduals with varied experiences, perspectives, and 
backgrounds. Females, minorities, dual-career 
couples, and persons with disabilities are encour- 
aged to apply. 

Rutgers is an affirmative action/equal oppor- 
tunity employer. 


Santa Clara University 
Assistant Professor in Scientific 
Computation/Statistics 


The Department of Mathematics and Computer | 


Science at Santa Clara University invites applica- 
tions for a tenure-track Assistant Professor position 
starting in September 2010 from candidates with 
expertise in scientific computation or statistics and 
interest in mathematical biology, computational 
physics or chemistry, or environmental science. 
Application deadline is January 20, 2010. For 
more information regarding application submis- 
sions, see www.scu.edu/hr/careers/faculty.cfm 


Southern Polytechnic State University 
Tenure-track positions in Computing 


We are seeking to fill up to four tenure-track Assis- 
tant Professor positions (other ranks considered) 
in Information Technology, Computer Game De- 
sign and Development, and Software Engineer- 
ing, anda Lecturer position in Computer Science. 
For the tenure-track positions, applicants must 
hold a relevant Ph.D., be committed to excellence 
in teaching, and have substantial research poten- 
tial. SPSU is located in the metro Atlanta area, 
which offers excellent quality of life. For more 
details, including specializations of interest, go 
to http://www.spsu.edu/hr/faculty.html. 


Stanford University 

Department of Computer Science 
Faculty Opening - Assistant or untenured 
Associate Professor 


The Department of Computer Science at Stanford | 
University invites applications for a tenure-track 
faculty position at the junior level (Assistant or | 
untenured Associate Professor). We give higher | 
priority to the overall originality and promise of 
the candidate’s work than to the candidate’s sub- | 
area of specialization within Computer Science. 

We are seeking applicants from all areas of 
Computer Science, spanning theoretical founda- 
tions, systems, software, and applications. We 
are also interested in applicants doing research 
at the frontiers of Computer Science with other 
disciplines, especially those with potential con- 
nections to Stanford’s main multidisciplinary 
initiatives: Energy, Human Health, Environment 
and Sustainability, the Arts and Creativity, and 
the International Initiative. 

Applicants must have completed (or be com- 
pleting) a Ph.D., must have demonstrated the 
ability to pursue a program of research, and must 
have a strong commitment to graduate and un- 
dergraduate teaching. A successful candidate 
will be expected to teach courses at the graduate | 
and undergraduate levels, and to build and lead a 
team of graduate students in Ph.D. research. Fur- 
ther information about the Computer Science De- 
partment can be found at http://cs.stanford.edu. 
The School of Engineering website may be found 
at http://soe.stanford.edu. 

Applications should include a curriculum 
vita, brief statements of research and teaching in- 
terests, and the names of at least four references. 
Candidates are requested to ask references to 
send their letters directly to the search committee. 
Applications and letters should be sent to: Search 
Committee Chair, c/o Laura Kenny-Carlson, via 
electronic mail to search@cs.stanford.edu. 

The review of applications will begin on Janu- 
ary 4, 2010, and applicants are strongly encour- 
aged to submit applications by that date; however, 
applications will continue to be accepted until the 
position is filled, but no later than May 1, 2010. 

Stanford University is an equal opportunity 
employer and is committed to increasing the 
diversity of its faculty. It welcomes nominations 
of and applications from women and members 
of minority groups, as well as others who would 
bring additional dimensions to the university’s 
research and teaching missions. 


Stevens Institute of Technology 
Assistant Professor of Computer Science 


The Computer Science Department at Stevens 
Institute of Technology invites applications for a 
tenure-track position beginning in August 2010. 
Special consideration will be given to candidates 
in computer vision, computer graphics, and ma- 
chine learning at the assistant professor level. 
However, outstanding applicants at other levels 
and/or in other areas of Computer Science may 
also be considered. 

Applicants are expected to have a Ph.D. in 
Computer Science or a closely related field, a 
demonstrated record of excellence in research, 
and a strong commitment to teaching. A success- 
ful candidate will be expected to conduct a vig- 


DECEMBER 2009 


VOL. 52 


orous, funded research program and to teach at 
both the undergraduate and graduate levels. 

Stevens Institute of Technology is a private 
university located in Hoboken, New Jersey. The 
55-acre campus is on the Hudson River across 
from midtown Manhattan within a few minutes 
from NYC via public transportation. Hoboken is 
a small upscale city, the residence of New Jersey’s 
governor, and the residence of choice for many 
professionals working in NYC. Faculty live in 
Hoboken, NYC, and in suburban communities in 
Northern New Jersey along commuter train lines 
to Hoboken and NYC. Stevens’ location offers 
excellent opportunities for collaborations with 
nearby universities such as NYU, Princeton, Co- 
lumbia, and Rutgers/DIMACS as well as industri- 
al research laboratories such as Bell Labs, AT&T 
Labs, IBM Research, Google New York, Siemens, 
and the Sarnoff Corporation. 

Applications should be submitted electroni- 
cally at http://www.cs.stevens.edu/Search. Ap- 
plications should include a curriculum vitae, 
teaching and research statements, and contact 
information for at least three references. Candi- 
dates should ask their references to send letters 
directly to the search committee. PDF is preferred 
for all application materials and reference letters. 
Further information is provided at the web site. 

Review of applications will begin on Decem- 
ber 1, 2009. 

Stevens is an Affirmative Action/Equal Oppor- 
tunity employer. 


Swarthmore College 
Computer Science Department 


Applications are invited for a two-year Visiting 
Assistant Professor position beginning August 
2010. 

Swarthmore College is a small, selective lib- 
eral arts college located in a suburb of Philadel- 
phia. The Computer Science Department offers 
majors and minors in computer science at the 
undergraduate level. Applicants must have teach- 
ing experience and should be comfortable teach- 
ing a wide range of courses at the introductory 
and intermediate level. We are particularly inter- 
ested in candidates who specialize in theory and 
algorithms or in systems areas, however, we will 
consider candidates from all areas of CS. A Ph.D. 
in CS by or near the time of appointment is pre- 
ferred (ABD is required). We expect to begin inter- 
viewing in early February 2010. 

See http://cs.swarthmore.edu/jobs for appli- 
cation submission information and more details 
about the position. Swarthmore College is an 
equal opportunity employer. Applications from 
women and members of minority groups are en- 
couraged. Applications will be accepted until the 
position is filled. 


The Ohio State University 
Assistant Professor 


The Department of Computer Science and Engi- 
neering (CSE), The Ohio State University, invites 
applications for two tenure-track positions at the 
Assistant Professor level. The positions are open 
to all CSE areas (artificial intelligence, graphics 
and animation, networking, software engineer- 
ing and programming languages, systems, and 
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theory) with priority consideration given to candi- 
dates in database systems and software engineer- 
ing & programming languages. 

Applicants should hold or be completing a 
Ph.D. in CSE or a closely related field, and have a 
commitment to and demonstrated record of ex- 
cellence in research as well as a commitment to 
excellence in teaching. 

To apply, please submit your application via 
the online database. The link can be found at: 

http://www.cse.ohio-state.edu/department/ 

positions.shtm] 


Review of applications will begin in January 
and will continue until the positions are filled. 

The Ohio State University is an Equal Op- 
portunity/Affirmative Action Employer. Women, 
minorities, or individuals with disabilities are en- 
couraged to apply. 


Toyota Technological Institute 
at Chicago (TTI-C) 
Faculty Positions at All Levels 


Toyota Technological Institute at Chicago (TTI-C) 
is a philanthropically endowed degree-granting 
institute for computer science located on the Uni- 
versity of Chicago campus. The Institute is expect- 
ed to soon reach a steady-state of 12 traditional 
faculty (tenure and tenure track), and 12 limited 
term faculty. Applications are being accepted in 
all areas, but we are particularly interested in: 

Theoretical computer science 

Speech processing 

Machine learning 

Computational linguistics 

Computer vision 

Scientific computing 

Programming languages 


Positions are available at all ranks, and we 
have a large number of limited term positions 
currently available. 

For all positions we require a Ph.D. Degree or 
Ph.D. candidacy, with the degree conferred prior 
to date of hire. Submit your application electroni- 
cally at: 

http://ttic.uchicago.edu/facapp/ 


Toyota Technological Institute at Chicago 
is an Equal Opportunity Employer 


UMBC 

University of Maryland Baltimore County 
An Honors University in Maryland 
Information Systems Department 


The Information Systems Department at UMBC 
invites applications for a tenure-track faculty po- 
sition at the Assistant Professor level in the area 
of human-centered computing (HCC) starting 
August 2010. Outstanding candidates in other ar- 
eas will also be considered. 

Candidates must have an earned PhD in Infor- 
mation Systems or a related field no later than Au- 
gust 2010. Applicants engaged in interdisciplinary 
research addressing accessibility (broadly defined 
to include issues associated with disabilities, age, 
culture, and other relevant topics), human-infor- 
mation interaction, or social computing are of pri- 
mary interest. Ideal candidates will be engaged in 
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research that spans two or more of these areas with 
preference given to those who can collaborate with 
current faculty. Candidates should have a strong 
potential for excellence in research, the ability to 
develop and sustain an externally funded research 
program, and the ability to contribute to our grad- 
uate and undergraduate teaching mission. 

The Department offers undergraduate de- 
grees in Information Systems and Business Tech- 
nology Administration as well as both the MS and 
PhD in Information Systems. In addition, the 
Department offers an MS and PhD in Human- 
Centered Computing, and was ranked 8th in the 
nation in 2007 for scholarly productivity by Aca- 
demic Analytics. Consistent with the UMBC vi- 
sion, the Department has excellent technical sup- 
port and teaching facilities as well as outstanding 
laboratory space and state of the art technology. 
UMBC’s Technology Center, Research Park, and 
Center for Entrepreneurship are major indicators 
of active research and outreach. Further details 
on our research, academic programs, and faculty 
can be found at http://www.is.umbc.edu/. Under- 
represented groups including women and minor- 
ities are especially encouraged to apply. 

Applications will not be reviewed until the 
following materials are received: a cover letter, a 
one-page statement of teaching interests, a one- 
page statement of research interests, one or more 
sample research papers, and a curriculum vitae. 
Applicants should also arrange to have three let- 
ters of recommendation sent to the department 
as soon as possible. Electronic submission of 
materials as PDF documents is preferred. Elec- 
tronic copies should be sent to bmorris@umbce. 
edu. Copies can also be sent to: Dr. Andrew Sears, 
Chair of Faculty Search Committee, Information 
Systems Department, UMBC, 1000 Hilltop Circle, 
Baltimore, MD 21250-5398. For inquiries, please 
contact Barbara Morris at (410) 455-3795 or bmor- 
ris@umbc.edu. 

Review of applications will begin immediately 
and will continue until the position is filled. This 
position is subject to the availability of funds. 

UMBC is an Affirmative Action/Equal Opportu- 
nity Employer and welcomes applications from mi- 
norities, women and individuals with disabilities. 


University of Calgary 
Assistant Professor, Information Security 


The Department of Computer Science at the Uni- 
versity of Calgary seeks an outstanding candidate 
for a tenure-track position at the Assistant Profes- 
sor level, in the Information Security area. Details 
for the position appear at: www.cpsc.ucalgary.ca/ 
career. Applicants must possess a doctorate in 
Computer Science or a related discipline at the 
time of appointment, and have a strong potential 
to develop an excellent research record. 

The Department of Computer Science is one 
of Canada’s leaders as evidenced by our commit- 
ment to excellence in research and teaching. It 
has an expansive graduate program and extensive 
state-of-the-art computing facilities. Calgary is a 
multicultural city that is the fastest growing city 
in Canada. Calgary enjoys a moderate climate 
located beside the natural beauty of the Rocky 
Mountains. Further information about the De- 
partment is available at www.cpsc.ucalgary.ca/. 

Interested applicants should send a CV, a con- 
cise description of their research area and program, 
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a statement of teaching philosophy, and arrange to 
have at least three reference letters sent to: 

Dr. Ken Barker 

Department of Computer Science 

University of Calgary 

Calgary, Alberta, Canada, T2N 1N4 

Or via email to: search@cpsc.ucalgary.ca 


The applications will be reviewed beginning 
November 2009 and will continue until the posi- 
tion is filled. 

All qualified candidates are encouraged to ap- 
ply; however, Canadians and permanent residents 
will be given priority. The University of Calgary re- 
spects, appreciates, and encourages diversity. 


University of Delaware 

Software Engineering Faculty Position 
Department of Computer and Information 
Sciences Bioinformatics and Computational 
Biology 

Faculty Position Center for Bioinformatics & 
Computational Biology 


Applications are invited for a tenure-track fac- 
ulty position in Software Engineering to begin 
Fall 2010. More information is available at http:// 
www.cis.udel.edu/SEposition. 

Applications are invited for a tenure-track as- 
sistant professor position to begin Fall 2010. We 
seek applicants from any area of bioinformatics 
and computational biology, but encourage appli- 
cants with interests in computational genomics. 
More information is available at http://bioinfor- 
matics.udel.edu/facultysearch. 


University of Houston-Downtown 
Assistant Professor 


The University of Houston-Downtown invites 
applications for a tenure-track Assistant Profes- 
sor position in Computer Science starting Fall 
2010. Successful candidates must have a PhD in 
Computer Science or closely related field in hand 
by the time of appointment, a promising research 
profile, and a commitment to excellence in teach- 
ing. For more information about the position 
please visit http://jobs.uhd.edu. Only online ap- 
plications will be forwarded to the hiring commit- 
ting for review. AA/EOE 


The University of Michigan, Ann Arbor 
Department of Electrical Engineering and 
Computer Science 

Computer Science and Engineering Division 
Faculty Position 


Applications and nominations are solicited for a 
faculty position in the Computer Science and En- 
gineering (CSE) Division as part of an interdisci- 
plinary cluster hire funded by the University Pres- 
ident to strengthen expertise in the area of Data 
Mining, Learning, and Discovery with Massive 
Datasets across multiple departments. Expertise 
is particularly sought in visual analytics and infor- 
mation visualization. 

Candidates with a focus in this area are encour- 
aged to apply. However, all computer science and 
engineering applications will be considered. Ap- 
plications must be received by January 11, 2010. 


Qualifications include an outstanding aca- 
demic record, a doctorate or equivalent in com- 
puter engineering or computer science, and a 
strong commitment to teaching and research. 

To apply please complete the form at: http:// 
www.eecs.umich.edu/eecs/jobs/csejobs.html 

Electronic applications are strongly preferred, 
but you may alternatively send resume, teach- 


ing statement, research statement and names of | 


three references to: 

Professor Karem A. Sakallah, Chair, CSE 
Faculty Search 

Department of Electrical Engineering and 
Computer Science 

University of Michigan 

2260 Hayward Street 

Ann Arbor, MI 48109-2121 


The University of Michigan is a Non-Discrimina- 
tory/Affirmative Action Employer with an Active 
Dual-Career Assistance Program. The college is 
especially interested in candidates who can con- 
tribute, through their research, teaching, and/or 
service, to the diversity and excellence of the aca- 
demic community. 


University of North Texas 
Department of Computer Science and 
Engineering 

Department Chair 


Applications and nominations are invited for 
the Chair position in the Department of Com- 
puter Science and Engineering at the University 
of North Texas. Candidates must have an earned 
doctorate in Computer Science and Engineering 
or a closely related field with a record of signifi- 
cant and sustained research funding and schol- 
arly output that qualifies them to the rank of full 
professor. Candidates must also demonstrate a 
record of teaching, research accomplishments, 
and professional leadership. Preferred: Adminis- 
trative experience as a department chair or direc- 
tor of personnel working in computer science and 
engineering; experience in curriculum develop- 
ment; and demonstrated experience mentoring 
junior faculty. A record of strategic planning and 
organizational adaptation as well as knowledge 
of academic standards and procedures required 
of accrediting agencies is also preferred. 

The committee will begin its review of the ap- 
plications on November 1, 2009 and will continue 
until the search is closed. For additional informa- 
tion and to apply please visit: http://facultyjobs. 
unt.edu/applicants/Central? quickFind=50503. 

Additional information about the department 
is available at www.cse.unt.edu. 

UNT is an AA/ADA/EOE. 


University of Rochester 
Assistant to Full Professor of Computer 
Science 


The UR Department of Computer Science seeks 
applicants for a tenure-track position for 2010. 
Candidates in computer vision, machine learning, 
networks, security, or algorithms are of particular 
interest, but strong applicants from all areas of com- 
puter science are welcome. Candidates must have 
a PhD in computer science or related discipline. 
Senior candidates should have an extraordinary re- 


cord of scholarship, leadership, and funding. 

The Department of Computer Science is one 
of the best small, research-oriented departments 
in the nation, with an unusually collaborative cul- 
ture and strong ties to cognitive science, linguis- 
tics, and electrical and computer engineering. 
Over the past decade, a third of its PhD graduates 
have won tenure-track faculty positions, and its 
alumni include leaders at major research labora- 
tories such as Google, Microsoft, and IBM. 

The University of Rochester is a private, Tier I 
research institution located in western New York 
state. The University of Rochester consistently 
ranks among the top 30 institutions, both public 
and private, in federal funding for research and 
development. Half of its undergraduates go on 
to post-graduate or professional education. The 
university includes the Eastman School of Music, 
a premiere music conservatory, and the Univer- 
sity of Rochester Medical Center, a major medical 
school, research center, and hospital system. The 
Rochester area features a wealth of cultural and 
recreational opportunities, excellent public and 
private schools, and a low cost of living. 

Candidates should apply online at http:// 
www.cs.rochester.edu/recruit. Review of appli- 
cations will begin on Dec. 1, 2009, and continue 
until all interview openings are filled. The Uni- 
versity of Rochester has a strong commitment 
to diversity and actively encourages applications 
from candidates from groups underrepresented 
in higher education. The University is an Equal 
Opportunity Employer. 


University of South Carolina 
Department Chair - Computer Science and 
Engineering 

The Department of Computer Science and 
Engineering 


Department Chair - Computer Science and En- 
gineering The Department of Computer Science 
and Engineering (www.cse.sc.edu) in the Col- 
lege of Engineering and Computing, University 
of South Carolina, seeks nominations and ap- 
plications for the position of Department Chair. 
The Department offers bachelor’s degrees in 
Computer Engineering, Computer Information 
Systems, and Computer Science, M.S., M.E. and 
Ph.D. degrees in Computer Science and Engi- 
neering, a Master of Software Engineering, and 
a Certificate of Graduate Studies in Information 
Assurance and Security. This is an active and en- 
gaged Department with 21 faculty members, in- 
cluding 20 with current research funding and 8 
NSF CAREER award winners. Enrollment is over 
300 undergraduate and 90 graduate students, in- 
cluding more than 50 doctoral students. 

Applicants must have outstanding leadership 
and administrative skills, and credentials (in- 
cluding a Ph.D. in computer science, computer 
engineering, or related field) commensurate 
with appointment as a full professor with tenure. 
Nomination letters should include statements 
regarding the nominee’s relevant credentials. Ap- 
plicants should submit a current resume, a state- 
ment of professional interests and vision, and the 
names, affiliations, and contact information of 
professional references. Applications will be ac- 
cepted until the position is filled and should be 
sent by email to 

cse-chair-search@cec.sc.edu. 
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The Department is particularly interested in 
receiving applications from minorities and wom- 
en. The University of South Carolina is an affirma- 
tive action, equal opportunity employer. 


The University of Texas at San Antonio 
Faculty Positions in Computer Science 


The Department of Computer Science at The 
University of Texas at San Antonio invites appli- 
cations for tenure/tenure-track positions at the 
Assistant, Associate or Professor level, starting 
Fall 2010. All areas of computer science will be 
considered. We are particularly interested in can- 
didates in systems software and algorithms. 

Required qualifications: Applicants for an As- 
sistant Professor position must have earned a Ph.D. 
prior to September 1, 2010, in Computer Science or 
ina related field and must demonstrate a strong po- 
tential for excellence in research and teaching. 

Applicants for an Associate Professor position 
must have a Ph.D. in Computer Science or related 
field and should have an established research 
program in their area of specialization. 

Applicants for a Full Professor position must 
have a Ph.D. in Computer Science or a related 
field and previous experience in graduate and 
undergraduate teaching and academic program 
development as well as a recognized and well- 
funded program of research. 

Responsibilities include research, teaching at 
the graduate and undergraduate levels, and pro- 
gram development. Salary and start-up supporting 
packages for the positions are highly competitive. 

The Department of Computer Science cur- 
rently has 24 faculty members and offers B.S., 
M.S., and Ph.D. degrees supporting a dynamic 
and growing program with 450 undergraduate 
and 120 graduate students. The research activi- 
ties and experimental facilities have been well- 
supported by various federal research and infra- 
structure grants. See http://www.cs.utsa.edu for 
additional information on the Department of 
Computer Science and its faculty. 

With over 29,000 students UTSA is the largest 
university in South Texas. The city of San Antonio 
has a population of over one million and is known 
for its rich Hispanic culture, historic attractions, 
affordable housing, and excellent medical facili- 
ties. The Austin-San Antonio corridor is a high-tech 
center that serves as the home of many major com- 
puter companies. Nearby higher education and re- 
search institutions include the UT Health Science 
Center and the Southwest Research Institute. 

Applicants must submit a letter of applica- 
tion that identifies the level(s) of the position for 
which they wish to be considered. Applications 
must also include a complete dated curriculum 
vitae (including employment, peer-reviewed pub- 
lications and grants in chronological order), a 
statement of research interests, and the names, 
addresses (postal and e-mail), and telephone 
numbers of at least three references. Applicants 
who are selected for interviews must be able to 
show proof that they will be eligible and qualified 
to work in the US by the time of hire. These posi- 
tions are pending budget approval. Screening of 
applications will begin on January 15, 2010 and 
will continue until all positions are filled or the 
search is closed. The University of Texas at San 
Antonio is an Affirmative Action/Equal Opportu- 
nity Employer. Women, minorities, veterans, and 
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individuals with disabilities are encouraged to 
apply. Applications must be submitted by email | 
in PDF format to fsearch@cs.utsa.edu. 
Chair of Faculty Search Committee | 
Department of Computer Science | 
The University of Texas at San Antonio 
One UTSA Circle 
San Antonio, TX 78249-0667 
Phone: 210-458-4436 Fax: 210-458-4437 
fsearch@cs.utsa.edu 


University of Waterloo 
David R. Cheriton School of Computer Science 
Cheriton Chairs in Software Systems 


Applications are invited for one or two David R. 
Cheriton Chairs in Software Systems. These are 
senior positions and include substantial research 
support and teaching reduction. Candidates with 
outstanding research records in software systems 
(very broadly defined) are encouraged to apply. | 
Successful applicants who join the University of 
Waterloo are expected to be leaders in research, 
have an active graduate student program and con- 
tribute to the overall development of the School. 
A Ph.D. in Computer Science, or equivalent, is 
required, with evidence of excellence in teaching 
and research. Rank and salary will be commen- 
surate with experience, and appointments are 
expected to commence during the 2010 calendar 
year. The Chairs are tenured positions. 

With over 70 faculty members, the University 
of Waterloo’s David R. Cheriton School of Com- 
puter Science is the largest in Canada. It enjoys 
an excellent reputation in pure and applied re- 
search and houses a diverse research program of 
international stature. Because of its recognized 
capabilities, the School attracts exceptionally 
well-qualified students at both undergraduate 
and graduate levels. In addition, the University 
has an enlightened intellectual property policy 
which vests rights in the inventor: this policy 
has encouraged the creation of many spin-off 
companies including iAnywhere Solutions Inc., 
Maplesoft Inc., Open Text Corp and Research in 
Motion. Please see our website for more informa- 
tion: http://www.cs.uwaterloo.ca. 

To submit an application, please register at 
the submission site: http://www.cs.uwaterloo.ca/ 
faculty-recruiting. Once registered, instructions 
will be provided regarding how to submit your ap- 
plication. Applications will be considered as soon 
as possible after they are complete, andaslongas | 
positions are available. | 

The University of Waterloo encourages appli- | 
cations from all qualified individuals, including 
women, members of visible minorities, native 
peoples, and persons with disabilities. All quali- 
fied candidates are encouraged to apply; however, 
Canadian citizens and permanent residents will 
be given priority. 


University of Waterloo | 
David R. Cheriton School of Computer Science 
Faculty Position in Software Engineering 


a tenure-track or tenured faculty position in the 
David R. Cheriton School of Computer Science, in 
the area of Software Engineering. Candidates at 


| 
The University of Waterloo invites applications for 
all levels of experience are encouraged to apply. | 


108 COMMUNICATIONS OF THE ACM = DECEMBER 


2009 


Successful applicants who join the University of | 


Waterloo are expected to develop and maintain a 
productive program of research, attract and de- 
velop highly qualified graduate students, provide 
a stimulating learning environment for under- 
graduate and graduate students, and contribute 
to the overall development of the School. A Ph.D. 
in Computer Science, or equivalent, is required, 
with evidence of excellence in teaching and re- 
search. Rank and salary will be commensurate 
with experience, and appointments are expected 
to commence during the 2010 calendar year. 

With over 70 faculty members, the University 
of Waterloo’s David R. Cheriton School of Com- 
puter Science is the largest in Canada. It enjoys 
an excellent reputation in pure and applied re- 
search and houses a diverse research program of 
international stature. Because of its recognized 
capabilities, the School attracts exceptionally 
well-qualified students at both undergraduate 
and graduate levels. In addition, the University 
has an enlightened intellectual property policy 
which vests rights in the inventor: this policy 
has encouraged the creation of many spin-off 
companies including iAnywhere Solutions Inc., 
Maplesoft Inc., Open Text Corp and Research in 
Motion. Please see our website for more informa- 
tion: http://www.cs.uwaterloo.ca. 

To submit an application, please register at 
the submission site: http://www.cs.uwaterloo.ca/ 
faculty-recruiting. Once registered, instructions 
will be provided regarding how to submit your ap- 
plication. Applications will be considered as soon 
as possible after they are complete, and as long as 
positions are available. 

The University of Waterloo encourages appli- 
cations from all qualified individuals, including 
women, members of visible minorities, native 
peoples, and persons with disabilities. All quali- 
fied candidates are encouraged to apply; however, 
Canadian citizens and permanent residents will 
be given priority. 


University of Waterloo 
David R. Cheriton School of Computer Science 
Faculty Positions in Health Informatics 


The University of Waterloo invites applications for 
one or two tenure-track or tenured faculty posi- 
tions in the David R. Cheriton School of Computer 
Science, in the area of Health Informatics. We de- 
fine health informatics broadly to include medical 
informatics and biomedical systems. The School 
plans to start a new graduate degree program in 
health informatics in September 2010. 

Candidates at all levels of experience are en- 
couraged to apply. Successful applicants who 
join the University of Waterloo are expected to 
develop and maintain a productive program of 
research, attract and develop highly qualified 
graduate students, provide a stimulating learn- 
ing environment for undergraduate and graduate 
students, and contribute to the overall develop- 
ment of the School. A Ph.D. in Computer Science, 
or equivalent, is required, with evidence of excel- 
lence in teaching and research. Rank and salary 
will be commensurate with experience, and ap- 
pointments are expected to commence during 
the 2010 calendar year. 

With over 70 faculty members, the University 
of Waterloo’s David R. Cheriton School of Com- 
puter Science is the largest in Canada. It enjoys 
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an excellent reputation in pure and applied re- 
search and houses a diverse research program of 
international stature. Because of its recognized 
capabilities, the School attracts exceptionally 
well-qualified students at both undergraduate 
and graduate levels. In addition, the University 
has an enlightened intellectual property policy 
which vests rights in the inventor: this policy 
has encouraged the creation of many spin-off 
companies including iAnywhere Solutions Inc., 
Maplesoft Inc., Open Text Corp and Research in 
Motion. Please see our website for more informa- 
tion: http://www.cs.uwaterloo.ca. 

To submit an application, please register at 
the submission site: http://www.cs.uwaterloo.ca/ 
faculty-recruiting. Once registered, instructions 
will be provided regarding how to submit your ap- 
plication. Applications will be considered as soon 
as possible after they are complete, and as long as 
positions are available. 

The University of Waterloo encourages appli- 
cations from all qualified individuals, including 
women, members of visible minorities, native 
peoples, and persons with disabilities. All quali- 
fied candidates are encouraged to apply; however, 
Canadian citizens and permanent residents will 
be given priority. 


University of Wisconsin - Platteville 
Assistant Professor 


The Department of Computer Science and Soft- 
ware Engineering at the University of Wisconsin- 
Platteville invites applications for two tenure- 
track faculty positions starting fall, 2010. 

One position will be in Computer Science, the 
other in Software Engineering. Candidates apply- 
ing for the Software Engineering position must 
have a Ph.D. in Software Engineering, Computer 
Science, or closely related field. The primary duty 
for this position will be teaching courses in an 
ABET-accredited software engineering program. 
Candidates applying for the Computer Science 
position must have a Ph.D. in Computer Science 
or closely related field such as Information Sys- 
tems. The primary duty for this position will be 
teaching courses in Computer Science. 

Candidates must have excellent verbal and 
written communication skills and a strong com- 
mitment to teaching. Commitment to scholarly 
and professional activities is also required. The 
candidate must have a demonstrated commit- 
ment to or experience with racially diverse popu- 
lations. Salary will be competitive and commen- 
surate with experience and qualifications. In 
addition, there are consulting opportunities with 
local companies. 

Send a letter of application, undergraduate & 
graduate transcripts, a statement illustrating your 
commitment to fostering campus racial diversity, 
and avita including references to Search and Screen 
Committee, Department of Computer Science and 
Software Engineering, 207 Ullrich Hall, 1 University 
Plaza, Platteville, WI 53818-3099 or to stutenbm@ 
uwplatt.edu. Visit our web site at http://www.uw- 
platt.edu/~csse for more information. Review of 
applications will start January 4, 2010 and will con- 
tinue until suitable candidates are found. 

UW-Platteville, an equal opportunity, affirma- 
tive action employer, seeks to build a diverse faculty 
and staff and encourages applications from women 
and persons of color. The names of all nominees 


and applicants who have not requested in writing 
that their identities be kept confidential, and of all 
finalists, will be released upon request. Employ- 
ment will require a criminal background check. 


cant screening will begin January 15, 2010 and 
continue until the position is filled. Inquiries 
should be directed to Dennis Kafura, Hiring Com- 
mittee Chair, kafura@cs.vt.edu. 


University of Wisconsin — Milwaukee 
Health Informatics Professor 


9-month tenure-track Assistant, Associate or Full 
Professor. Program planning and management, 
teaching, advising students, developing and 
maintaining an active line of research and seeking 
extramural funding. Apply URL: http://www.jobs. 
uwm.edu/applicants/Central?quickFind=50785 


Ursinus College 
Assistant Professor of Computer Science 


Ursinus College, a liberal arts college located in 
the suburbs of Philadelphia, invites applications 
for a tenure-track position in Computer Science. 
Please see http://www.ursinus.edu/NetCommu- 
nity/Page.aspx?pid=390 for more information. 


Virginia Tech 

Department of Computer Science 
Artificial Intelligence/Machine Learning 
Senior Position 


The Department of Computer Science at Vir- 
ginia Tech (www.cs.vt.edu) invites applications 
for a full-time tenured position at the Professor 
or Associate Professor rank from candidates in 
artificial intelligence with particular interests in 
machine learning, knowledge representation, 
or data mining. Candidates should have an es- 
tablished record of scholarship, leadership, and 
collaboration in computing and interdisciplin- 
ary areas; demonstrated ability to contribute to 
teaching at the undergraduate and graduate lev- 
els in Al and related subjects; sensitivity to issues 
of diversity in the campus community; and the 
skills needed to establish and grow a multidisci- 
plinary research group. 

CS@VT has over 40 tenure-track research-ori- 
ented faculty. PhD production is among the top 
30 in the US and annual research expenditures 
exceed $6 million. There are rich opportunities in 
a highly collaborative department with strengths 
in HCI, HPC, CS education, digital libraries, com- 
putational biology and bioinformatics. Active 
interdisciplinary research also explores Cyber- 
Arts, digital government, problem-solving envi- 
ronments. Emphases on security and personal 
health informatics are underway in collaboration 
with the newly formed VT-Carilion Research In- 
stitute associated with the VT-Carilion School of 
Medicine, opening in Fall 2010. 

CS@VT is part of the College of Engineering 
(www.eng.vt.edu) in a comprehensive research 
university with more than 26,000 students. The 
main campus is in Blacksburg, which is consis- 
tently ranked among the country’s best places to 
live (http://www. liveinblacksburg.com/). 

Salary for suitably qualified applicants is com- 
petitive and commensurate with experience. Vir- 
ginia Tech is an Equal Opportunity/Affirmative 
Action Institution. 

Applications must be submitted online to 
https://jobs.vt.edu for posting #090529. Appli- 


Washington University in Saint Louis 
Multiple tenure-track/tenured faculty positions 


The Department of Computer Science and Engi- 
neering (CSE) and the School of Medicine (WUSM) 
are jointly searching for multiple tenure-track 
faculty members with outstanding records of 
computing research and a serious interest in col- 
laborative research on problems related to biol- 
ogy and/or medicine. Appointments may be made 
wholly within CSE or jointly with the Departments 
of Medicine or Pathology & Immunology. 

A key initiative in the CSE Department’s stra- 
tegic plan is Integrating Computing and Science. 
As part of that initiative, we expect to make syn- 
ergistic hires with a combined research portfolio 
spanning the range from fundamental computer 
science/engineering to applied research focused 
on science or medicine. Specific areas of interest 
include, but are not limited to: 
> Analysis of complex genetic, genomic, pro- 
teomic, and metabolomic datasets; 
> Theory/Algorithms with the potential for bio- 
medical applications; 
> Image analysis or visualization with the poten- 
tial for biomedical applications; 
>» Databases, medical informatics, clinical or 
public-health informatics; 
>» Computer engineering with applications to 
medicine or the natural sciences; 

» All areas of computational biology and biomed- 
ical informatics 


These positions will continue a successful, 
ongoing strategy of collaborative research be- 
tween CSE and the School of Medicine, which 
is consistently ranked among the top 3 medi- 
cal schools in the United States. CSE currently 
consists of 24 tenured and tenure-track faculty 
members, 71 Ph.D. students, and a stellar group 
of undergraduates with a history of significant 
research contributions. The Department seeks to 
build on and complement its strengths in biologi- 
cal sequence analysis, biomedical image analysis, 
and biomedical applications of novel computing 
architectures. Exceptional candidates conduct- 
ing research in other areas of Computer Science 
are also encouraged to apply. 

Washington University is a private university 
with roughly 6,000 full-time undergraduates and 
6,000 graduate students. It has one of the most at- 
tractive university campuses anywhere, and is locat- 
ed ina lovely residential neighborhood, adjacent to 
one of the nation’s largest urban parks, in the heart 
of a vibrant metropolitan area. St. Louis is a won- 
derful place to live, providing access to a wealth of 
cultural and entertainment opportunities without 
the everyday hassles of the largest cities. 

We anticipate appointments at the rank of 
Assistant Professor; however, in the case of ex- 
ceptionally qualified candidates appointments 
at any rank may be considered. Applicants must 
have a Ph.D. in computer science, computer en- 
gineering, electrical engineering, biomedical en- 
gineering, or a closely related field and a record 
of excellence in teaching and research appropri- 
ate to the appointment level. The selected candi- 
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date is expected to build an externally-supported 
research program, teach and mentor students at 
the graduate and undergraduate levels, and fos- 
ter interdisciplinary interactions with colleagues 
throughout the university. Candidates who would 
contribute to enhancing diversity at the depart- 
mental and university levels are strongly encour- 
aged to apply. Applications from academic cou- 
ples are welcomed and encouraged. 

Qualified applicants should submit a com- 
plete application (cover letter, curriculum vita, 
research statement, teaching statement, and 
names of at least three references) electronically 
by following the directions provided at http://cse. 
wustl.edu/faculty-recruiting/. Other communica- 
tions may be directed to Prof. Michael Brent, De- 
partment of Computer Science and Engineering, 
Campus Box 1045, Washington University, One 
Brookings Drive, St. Louis, MO 63130-4899. 

Applications submitted before January 31, 
2010 will receive full consideration. Washington 
University is an equal opportunity/affirmative ac- 
tion employer. 


Wellesley College 

Computer Science Department 
Norma Wilentz Hess Visiting Assistant 
Professor 


Wellesley College invites applications for a two- 
year Norma Wilentz Hess Fellowship in the De- 
partment of Computer Science. Funded at the 
Visiting Assistant Professor level, this fellowship 
will help Wellesley maintain a flexible Computer 
Science curriculum that explores interdisciplin- 
ary learning and new directions of special prom- 
ise. With a teaching load of one course per semes- 
ter, the Hess Fellow will have ample opportunity 
for innovative course development and collabora- 
tive teaching and research projects with Wellesley 
faculty and students. The fellowship also includes 
support for travel, research, equipment, confer- 
ence attendance, and other academic activities. 
For more information about the department and 
the college, please visit http://cs.wellesley.edu. 

Applicants should have a Ph.D (or be close 
to completion) in Computer Science or a related 
discipline. Strong candidates in any area of spe- 
cialty will be considered. We especially encour- 
age applicants in interdisciplinary fields such as 
artificial intelligence, bioinformatics, human/ 
computer interaction, and ethical, educational, 
social, and legal aspects of computing. 

Interested applicants are requested to sub- 
mit a letter of application, curriculum vitae, and 
names and email addresses of three references 
through our on-line application system at https:// 
career.wellesley.edu. Include a statement dis- 
cussing teaching philosophy and ideas for course 
development and research projects. Applications 
will be reviewed starting January 11, 2010. If cir- 
cumstances make it impossible to submit any 
materials electronically, you may email work- 
ing@wellesley.edu for assistance. Questions con- 
cerning the position should be directed by email 
to Randy Shull at rshull@wellesley.edu. 

Wellesley College is an Affirmative Action/ 
Equal Opportunity Employer, and we are com- 
mitted to increasing the diversity of the college 
community and the curriculum. Candidates who 
believe they can contribute to that goal are en- 
couraged to apply. 
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Indiana University 
School of Informatics and Computing 
Faculty Positions siasnesein Perna 


The School of Informatics and Computing / bd e e + 
at Indiana University, Bloomington, invites | Ita | rar 
applications for faculty positions. We are } 


particularly interested in tenure-track 
candidates in the areas of: — ee 

¢ Health Informatics | 

* Cyber-infrastructure W W W ° a C mM ° O r g /d | 

* Software and Systems 

* Data Mining, Machine Learning 

and Search ¢ Advanced Search 

« Artificial Intelligence Full text of every article ever published by ACM. + Browse the Digital Library: 

* Human-Computer Interaction «USI Te ANSE Uigital Lirary were 

* Graphics and Visualization; and 2 Questions (FAN'S 

* Algorithms 


Most positions are expected to be filled 


Recently loaded issues and proceedings: 
(available in the DL. within the past 2 weeks) y Affiliated Organizations 
Special Interest Groups (SIGs) 


2s (SIGS 
ri fi fi f J 
at the junior level but outstanding senior ieee J 
candidates will be considered, particularly Journal on Educational Resources in Computing Pain Services: Loin reauited 
j H JERIC) My Binders 
in the broad area of cyber-infrastructure, * oadieand Sis camila, Sie dos 
software and systems. ACM Transactions on Applied Perception (TAP) @ with colleagues and build bibliographies, 
j Vol ZIssue OC Service 
The 1U Bloomington School of Informat- eee Sa 
ics and Computing is the first of its kind ON Manca ane af Sofware engineering ang issues or proceedings become available 
and among the largest in the country, with $< Feedback ae ea 


a faculty of more than 60 full time mem- ‘ _a problem a et vata Ue reste 

bers, more than 400 graduate students, and Bd ction surve — searchabilty 

widely subscribed undergraduate programs. | | c 5g 

It offers undergraduate degrees in Computer ‘hevene feos Saas eee 

Science and in Informatics, M.S. degrees in SY ; Uap 

Computer Science, Bioinformatics, Human 

Computer Interaction Design, and Security - . 

Informatics, and Ph.D. degrees in Computer 

Science and in Informatics. The School has | | The U Iti mM ate O n i i n © 

received public recognition as a “top-ten pro- 

gram to watch” (Computerworld) thanks to | | 

its excellence and leadership in academic j N FO RMATIO N TECH N O LOGY 
programs, interdisciplinary research, place- |_| 

ment, and outreach. The school offers ex- R | 
cellent work conditions, including attractive | _ eso U rce e 
salaries and research support, low teaching | | 

loads, and world-class computing and library . . 
facilities. The school continues strong stu- - Over 40 ACM publications, plus conference 


dent growth, over 30% in the last two years. | | proceedings 


The school hired 5 new faculty last year and | | . 
plans to hire a similar number this year. ¢ 50+ years of archives 


Located in the wooded, rolling hills of | e Advanced searching capabilities 
southern Indiana, Bloomington is a culturally eras 
thriving college town with a moderate cost ¢ Over 2 million pages of downloadable text 
of living. It is renowned for its top-ranked 


musi school, performing and fine arts, | Plus over one million bibliographic 


historic campus, cycling traditions, active 


lifestyle, and natural beauty. H H H j 
Applicants should submit a curriculum citations are available in the 


vitae, a statement of research and teaching, ACM Guide to Com puting Literature 
and the names of three references (six for 
associate and full professor candidates) 


USING: Ene A BTEIE INK ae RMA iver Manes. To join ACM and/or subscribe to the Digital Library, contact ACM: 
indiana.edu/hiring (preferred) or by mail to: 


Chair, Faculty Search Committee | Phone: 1.800.342.6626 (U.S. and Canada) 


School of Informatics and Computing +1.212.626.0500 (Global) 
919 E 10th Street ; reat rasa 
Bloomington, IN 47408 Fax: +1.212.944.1318 


Questions may be sent to: Hours: 8:30 a.m.-4:30 p.m., Eastern Time 


hiring@informatics.indiana.edu. Email: acmhelp@acm.org 


To receive full consideration completed Join URL: www.acm.org/joinacm 


applications must be received by December Sit. ; 
31, 2009. However, applications received Mail: ~~ ACM Member services 
after this date will be considered until all General Post Office 
positions are filled. PO Box 30777 
Indiana University is an Equal Opportunity/ New York, NY 10087-0777 USA 


Affirmative Action employer. Applications from 
women and minorities are strongly encouraged. 
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Solutions and Sources 


Last month (November 2009, p. 112) we posted a trio of brain teasers, 
including one as yet unsolved, concerning the covering of a plane. 


1 Identical Coins on 

= Rectangular Table. 

Solution. Let us observe that if we dou- 
ble the radius (say, from one to two 
inches) of each of the original coins, 
the result would be to cover the whole 
table. Why? Well, if a point P isn’t cov- 


ered, it must be two inches or more | 


from any coin center; thus a one-inch 
coin placed with its center at P would 
fit in the original configuration. 

If we could replace each big coin 
with four small coins covering the 
same area, we’d be done... but we can’t 
cover a big coin with four small coins. 
Instead, let us note that rectangles 
have the property that they can be par- 
titioned into four copies of themselves. 


So, we shrink the whole picture (of big | 


coins covering the table) by a factor of 
two in each dimension and use four 
copies of the new picture to cover the 
original table. 

Surprisingly (perhaps), this lovely 
but seemingly crude argument gives 
the best possible factor: Replace the 
factor 4 with anything smaller, say 3.99, 
and the statement of the puzzle (with 
100 and 400 replaced by n and 3.997) is 
no longer true. 

The puzzle came to me by way of 
computer scientist Guy Kindler of the 
School of Computer Science and En- 


gineering in the Hebrew University of | 


Jerusalem during a marvelous visiting 
year (2003-2004) by each of us at the 
Institute for Advanced Study in Princ- 
eton, NJ. 


2? Two Sets of Parallel Lines on 
ulnfinite Plane. 

Solution. The puzzle asked whether 
we could cover each point of the plane 
exactly twice using a set of lines that 
contains lines in more than two dif- 
ferent directions. 

The solution will disappoint some 
of us. The answer is yes, assuming the 
Axiom of Choice, which allows us to 
make choices at every step of an infi- 
nite process. But the proof presented 
here requires transfinite induction(!), 
leaving us without any geometry we 
can wrap our mind around. The prob- 
lem (and its solution) were provided to 
me by mathematical physicist Senya 
Shlosman of the Centre de Physique 
Theorique, Marseille, France, who is 
unaware of its origin. 

Nonetheless, the solution appeals 
to me as an example of an easy applica- 
tion of a powerful tool. The idea is that 
we start off with three lines that cross 
one another (so we already have three 
directions). Roughly speaking, at each | 
moment in our algorithm we have con- 
structed a finite or countable number 
of lines, with no point covered twice; 
we then pick a point on the plane that | 
is not doubly covered and add a line 
through that point. How do we know 
we can do this without triple-covering 
some other point? Because the number 
of pairs of lines so far constructed is 
countable, only countably many points 
are double-covered, and we have an un- 
countable number of angles at which 
the new line can be placed. 

Does this seem like cheating? It _ 
should. First, those of us who have | 
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used transfinite induction know that 
details have been omitted (but easily 
supplied). More significant, there is no 
way we can do this construction in a 
useful manner. Does this mean there 
isn’t some clever, effective, or at least 
imaginable way to double-cover the 


| plane with lines, other than by two par- 


allel classes? I haven’t found one. Nei- 
ther has Shlosman. Perhaps you can. 
3 Colored Discs on 

alnfinite Plane. 
Conjecture. This open puzzle sup- 
posed that we were given a collection 
of open-unit discs that is a thousand- 
fold cover of the plane; that is, every 
point of the plane is covered by at 
least 1,000 discs. Our job was to prove 
(or disprove) that we can color each 
disc red or blue in such a way that the 
red discs cover the plane and the blue 
discs also cover the plane. 

No solutions have been offered so 
far. Maybe this is just fundamentally 
very difficult to prove. Maybe it isn’t 
even true. But it seems reasonable, 
don’t you think? 


All readers are encouraged to submit prospective 
puzzles for future columns to puzzled@cacm.acm.org. 


Peter Winkler (puzzled@cacm.acm.org) is Professor of 
Mathematics and of Computer Science and Albert Bradley 
Third Century Professor in the Sciences at Dartmouth 
College, Hanover, NH. 
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| 1980, when the Apple II came out. I 


The computer reconfigures me. 


I'VE BEEN A writer for 40 years, and my 
attitude toward writing machines has 
never been simple. Right now I’m sit- 
ting in a quiet one-room cabin in the 
Maine woods, writing in natural light 
with a fountain pen into a bound blank 
book. I could have done that in the 19th 
century—or 5,000 years ago, if you trade 
the fountain pen for a quill or stylus. 

But when the day’s writing is done, I 
pull out my featherweight MacBook Air 
and type the words from the book into a 
text file. Then I port it over to NeoOffice 
to get a word count for the day. 


I have various reasons to make it | 


Mightier Than the Pen 


this complicated, but the main one is | 


simple: that hand-written draft is a true 
first draft. You can name a computer 
file “first draft,” but there’s no way to 
prove it was first. In fact, you would be a 
very strange writer if you didn’t go back 
and forth, changing things, long before 
you finished the draft. 

Why should that make any differ- 
ence? Well, as a practical matter, some- 
times you do get something right the 
first time. After a book’s gone through 
a half-dozen rewrites, it doesn’t hurt to 
look back and see whether the changes 
were all for the better. 

There’s also the archival and finan- 
cial value of the physical document it- 
self. Even for a moderately well-known 
writer like myself, the holographic 
(handwritten) manuscript of a novel is 
worth thousands or tens of thousands 
of dollars. I’ve twice put handwritten 
drafts of short stories up on eBay, and 
collectors paid more than I got from 
the magazine that printed them. In ad- 
dition, if you’re egotistical enough to 
want your work to last after you die, it’s 
nota bad strategy to leave behind lots of 
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material for scholars—so they’ll write 
papers on you and force their students 
to buy your books. 

I get a lot of esthetic, or at least craft, 
pleasure out of writing an actual book. 
In front of my window in this little cabin 
is a cup holding a dozen different foun- 
tain pens and four different bottles of 
ink. Each combination has its own per- 
sonality, and it’s fun to start out the day 
making that small decision. 

I’ve written about 20 books with this 
collaboration between pen and com- 
puter, and although I love my pens and 
blank books with hobbyist zeal, ifI had | 
to choose between them and the com- 
puter there would be no contest. The 
pens would have to go, even though 
they’re so familiar they’re like part of 
my hand. The computer is part of my 
brain. It has reconfigured me. 

When I started writing, there weren’t 
any out-of-the-box word processors with- 
in the budget of a freelance writer. (Ste- 
phen King had an industrial-strength 


ing down from the ceiling.) If you were 
good with hobbyist electronics, you 
could cobble something together from 
Ohio Scientific or Radio Shack. But like 
most writers, I had to wait until around | 


eee 
Sometime during the 
Age of Primates the 
internet got into the 
evolutionary stroll. 


VOL. 52 NO. 12 


bought one of the first, along with a 
printer about the size and weight ofa VW 
Beetle. It used Doctor Memory as a word 
processor; the letters were all scream- 
ing capitals, with actual caps showing in 
eye-straining white-on-black. 

By today’s standards it was crude 
and slow, and it took a half-dozen big 
fragile floppy disks to store a novel. But 
what a feeling of power. 

At least when it worked. You heard 
horror stories about a person touch- 
ing the wrong key and losing an entire 
novel. I once lost a chapter by hitting 
“Backspace” at the wrong time. I told 
my friends that this new machine was 
really great; no mere electric typewriter 
could leap off your desk into a filing 
cabinet and eat a chapter of your novel. 

One triumphant day I forced the 
top off the Apple I, I think voiding the 
warranty, and swapped out a couple of 
chips and soldered in a third-party com- 
ponent—and suddenly I had lower-case 
letters, black on white, just like a type- 
writer but infinitely malleable. From 
then on Iwas crawling up the beach, the 
fish gulping air becoming the lizard be- 
coming the mammal and then the ape. 
Sometime during the Age of Primates 
the Internet and Web got into the evolu- 
tionary stroll, and the computer was no 
more a fancy typewriter than a human 
is a fancy chimp. 

About 15 discarded machines litter 
that evolutionary beach, and it’s a long 
way from the Sinclair that whistled into 
a tape recorder to the superlight Frisbee 
of a Mac Air that talks to satellites and 
brings a huge and complex universe 
into my rustic little shack. 

Evolution doesn’t end. The ape be- 
came the human, and now the human 
is drawing pictures in the sand. 


Joe Haldeman (haldeman@earthlink.net) is an adjunct 
professor in the Program in Writing and Humanistic Studies 
at the Massachusetts Institute of Technology, Cambridge, 
MA. His most recent novel is Marsbound. His novel 
Camouflage won the Nebula Award for Best Novel in 2005. 


© 2009 ACM 0001-0782/09/1200 $10.00 


CONNECT WITH OUR 
COMMUNITY OF EXPERTS. 


www.reviews.com 
jill Association for 
& la They'll help you find the best new books 
Reviews.com and articles in computing. 


Computing Reviews is a collaboration between the ACM and Reviews.com. 


http://fdg2010.org 


the International Conference on the 
Foundations of Digital Games 


June 19 = 21 20] O 


Submission Deadlines 


eK 


Doctoral Consortium: 12th Feb 
Demos: 2nd April 


=) Andrew Fi 


Infrastructure (Databases, Se ae Ce 
Networks, Security) Graphics & Interfaces SN ti 


Artificial Intelligence Computer Science & Game Studies 


Games Education 


Conference Chair: lan Horswill, Northwestern University An Official Conference of the 
Program Chair: Yusuf Pisan, University of Technology, Sydney Society for the Advancement of 
Doctoral Consortium Chair: Zoran Popovic, University of Washington the Science of Digital Games 
Workshops Chair: Michael Mateas, University of California, Santa Cruz 

Panels Chair: lan Bogost, Georgia Institute of Technology SAS D G 
Tutorials Chair: Robin Hunicke, That Game Company 

Industrial Relations Chair: Hiroko Osaka, Northwestern University EDG 2010 Suppired by: 


Local Arrangements Chair: Marilyn Walker, University of California, Santa Cruz : 
Webmaster: Karl Cheng-Heng Fua, Northwestern University Microsoft 


es Research 


